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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re application of: 
STEIMER ef a/. 

Serial Number: National Stage Application 
ofPCT/EPOO/08994 

Filed: March 15,2002 

For: Transcriptionally Silenced Plant Genes 



Art Unit: TBA 
Examiner: TBA 

Atty Docket: S-31147A 



PRELIMINARY AMENDMENT 

BOX PCT 

U.S. Patent and Trademark Office 
P.O. Box 2327 
Arlington, VA 2202 

Sir: 



Applicants respectfully request that the above-captioned application be amended as 
follows in advance of prosecution: 

IN THE SPECIFICATION : 

At page 1, after the title, please insert: 

This application is a § 371 of International Application No. PCT /EP00/08994, 
filed September 14, 2000. 

IN THE CLAIMS 

Please amend claim 5 to read as follows: 

5. (Amended) The nucleic acid of claim 1 comprising an open reading frame 

encoding a protein comprising a component sequence of at least 200 amino acids 
in length being at least 85% identical to an aligned component of SEQ ID NO: 
10. 
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Please add the following new claims: 

14. (New) The nucleic acid of claim 2 comprising an open reading frame encoding 
a protein comprising a component sequence of at least 200 amino acids length 
in length being at least 85% identical to an aligned component of SEQ ID NO: 
10. 

15. (New) The nucleic acid of claim 3 comprising an open reading frame encoding 
a protein comprising a component sequence of at least 200 amino acids in 
length being at least 85% identical to an aligned component of SEQ ID NO: 10. 

16. (New) The nucleic acid of claim 4 comprising an open reading frame encoding 
a protein comprising a component sequence of at least 200 amino acids in 
length being at least 85% identical to an aligned component of SEQ ID NO: 10. 

REMARKS 

Claim 5 has been amended and new claims 14-16 have been added to eliminate 
multiple dependency and to ensure that they are in compliance with the rules of U.S. 
practice. No new matter has been added. 

If in the opinion of the Examiner, a telephone conference would expedite the 
prosecution of the subject application, the Examiner is invited to call the undersigned 
attorney. 



The Commissioner has been authorized to charge the fees associated with the filing 
of the application that this amendment accompanies and associated papers to Applicant's 
credit card. If there are any additional fees required, authorization is hereby provided to 
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charge such fees to Applicant's Deposit Account No. 50-1744 (in the name of Syngenta 
Biotechnology, Inc.). 



Respectfully submitted, 



Syngenta Biotechnology, Inc. 
Patent Department 
P.O. Box 12257 

Research Triangle Park, NC 27709-2257 
(919) 541-8566 
December 21, 2001 



Marcia R. Morton 
Attorney for Applicants 
Reg. No. 46,942 
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Version With Markings To Show Changes Made 

The specification has been amended as follows to include the priority data: 

-This application is a § 371 of International Application No. PCT /EP00/08994, 
filed September 14, 2000.- 

The claims have been amended as follows: 

5. (Amended) The nucleic acid of [any one of claims 1 to 4] claim 1 comprising an 
open reading frame encoding a protein comprising a component sequence of at least 
200 amino acids in length being at least 85% identical to an aligned component of 
SEQIDNO: 10. 

— 14. (New) The nucleic acid of claim 2 comprising an open reading frame 
encoding a protein comprising a component sequence of at least 200 amino acids 
length in length being at least 85% identical to an aligned component of SEQ ID 
NO: 10. 

15. (New) The nucleic acid of claim 3 comprising an open reading frame encoding 
a protein comprising a component sequence of at least 200 amino acids in length 
being at least 85% identical to an aligned component of SEQ ID NO: 10. 

16. (New) The nucleic acid of claim 4 comprising an open reading frame encoding 
a protein comprising a component sequence of at least 200 amino acids in length 
being at least 85% identical to an aligned component of SEQ ID NO: 10.-- 
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TranscriptionallySilenced Plant Genes 



The present invention relates to the field of gene expression in plants and in particular 
concerns gene silencing, a phenomenon frequently observed after integration of transgenes 
into plant genomes. Comparison of transcriptional gene expression between an Arabidopsis 
line carrying a silent transgene present in multiple copies and its mutant derivative mom 1 
impaired in silencing of the transgene revealed two cDNA clones which are expressed in 
the mutant plants, but not in the parental and not in wild type plants. Both clones are 
derived from the same family of transcripts which we refer to as TSI (Transcriptionally Silent 
information). The disclosed genomic templates encoding TSI are repetitive elements with 
mainly pericentromeric location and conserved organization among various ecotypes. They 
are also referred to as TSI. Transcriptional silencing of the genomic TSI templates is 
specifically released in the mutant. Silencing of said templates is further released in other 
genotypes known to affect transcriptional gene silencing. Thus, transcription of TSI can be 
used as a marker to identify a defective silencing pathway in a plant. 



Correct balance between activation and silencing of its genetic information is essential for 
any living cell. A tight control of gene expression is necessary for adaptation to 
environmental factors, regulation of physiological requirements, and development of 
differentiated, specialized cell types within a multi-cellular organism. For example 
differentiation processes involve mitotically heritable changes of gene expression, wherein 
the acquired states of gene activity gain a certain stability. This stability can be achieved by 
the strict control of gene activators, by regulation of transcript stability, or by regulating the 
transcriptional availability of genetic information itself as by stable silencing of selected 
genetic loci. Silencing has been frequently observed in connection with repression of 
transgene expression in various experimental systems. 

In plants, silencing of transgenic loci limits the reliability of transgenic approaches to 
improve quality traits. It has been noticed that complex inserts containing rearranged 
multiple copies of a transgene are particularly prone for gene silencing. Two different 
mechanisms leading to loss of transgene expression are observed. The first prevents 
transcription (transcriptional gene silencing or TGS), and the second targets selected 
transcripts for rapid degradation (posttranscriptional gene silencing or PTGS). Triggers of 
both processes seem to be similar, since the onset of both types of silencing correlates with 
redundancy of genetic information, i.e. DNA repeats in case of TGS and RNA 
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overproduction for PTGS. TGS is meiotically heritable and correlates with DNA template 
modification manifested by hypermethylation of promoters of silenced genes or with local 
changes of chromatin structure. In contrast, PTGS is not meiotically transmitted and needs 
to be reestablished in each sexual generation. PTGS does not require modification of a 
DNA template, however, increased levels of DNA methylation within the protein-coding 
region of silenced genes have been observed. 

The majority of silencing studies in plant systems deal with silencing of transgenes. There 
are only a few examples of gene silencing without involvement of transgenic loci. The 
criteria for TGS susceptibility of genetic information is very poorly understood, and the 
natural targets of transcriptional silencing in a normal, wild type plant are yet to be 
discovered. It has been postulated that TGS is a defense system against invasive DNA 
such as transposable elements but experimental evidence for this hypothesis is lacking. 

Within the context of the present invention reference to a gene is to be understood as 
reference to a DNA coding sequence associated with regulatory sequences, which allow 
transcription of the coding sequence into RNA such as mRNA, rRNA, tRNA, snRNA, sense 
RNA or antisense RNA. Examples of regulatory sequences are promoter sequences, 5' and 
3' untranslated sequences, introns, and termination sequences. 

A promoter is understood to be a DNA sequence initiating transcription of an associated 
DNA sequence, and may also include elements that act as regulators of gene expression 
such as activators, enhancers, or repressors. 

Expression of a gene refers to its transcription into RNA or its transcription and subsequent 
translation into protein within a living cell. 

Any part or piece of a specific nucleotide or amino acid sequence is referred to as a 
component sequence . 

It is the aim of the present invention to provide nucleic acid molecules encoding genetic 
information which is not expressed, i.e. silenced, in wild type plants but whose expression is 
turned on in plants which are defective in transcriptional gene silencing. Said molecules can 
be defined by the formula R a -Rb-Rc wherein 

Ra, Rb and R c indicate component sequences consisting of nucleotide residues 

independently selected from the group of G, A, T and C or G, A, U and C, wherein 

G is Guanosinmonophosphate, 

A is Adenosinemonophosphate, 
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T is Thymidinmonophosphate, 
U is Uridinmonophosphate and 
C is Cytidinmonophosphate; 

R A and R c consist independently of 0 to 6000 nucleotide residues; 

- R B consists of at least 50 nucleotide residues; and 

— the component sequence R B is at least 80% identical to an aligned component 
sequence of SEQ ID NO: 1 , SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID 
NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 27. 

In a preferred embodiment of the present invention R B consists of at least 100 nucleotide 
residues and is at least 85% identical to an aligned component sequence of SEQ ID NO: 1, 
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, 
SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 27. 

In another preferred embodiment R B consists of at least 200 nucleotide residues and is at 
least 90% identical to an aligned component sequence of SEQ ID NO: 1 , SEQ ID NO: 2, 
SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ID NO: 9 or SEQ ID NO: 27. 

Specific examples of R B are the sequences given in SEQ ID NO: 7, SEQ ID NO: 9 and 
SEQ ID NO: 27. 

Additionally, R A or R c may comprise one or more component sequences with a length of at 
least 50 nucleotide residues and at least 90% identical to an aligned component sequence 
of SEQ ID NO: 1 , SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 27. 

The nucleic acid molecules according to the present invention exist either in the form of 
DNA or as RNA. Preferred embodiments are genomic DNA, cDNA, plasmid DNA or RNA 
transcribed therefrom. 

Nucleotides 437-2383 of SEQ ID NO: 1 encode a putative open reading frame of 648 amino 
acids (SEQ ID NO: 10) which in SEQ ID NO: 1 is interrupted by a stop codon spanning 
nucleotides 1631-1633. Nucleic acids encoding a protein comprising a component 
sequence of at least 200 amino acids length being at least 85% identical to an aligned 
component sequence of SEQ ID NO: 10 are a further preferred embodiment of the present 
invention. 
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The nucleic acids according to the present invention represent an endogenous target of the 
transcriptional silencing system. Example 1 describes the cloning of specific embodiments 
of the present invention from Arabidopsis. The preferred size of transcribed RNA is between 
1000 and 6000 nucleotides, particularly transcripts of about 1250, 2500, 4700 and 5000 
nucleotides, which can be polyadenylated or not. The transcriptionally silent information 
present in the genome of wild type plants is found to be only expressed in a range of 
mutants affected in the maintenance of transcriptional silencing. Importantly, not only 
strains affected in transcriptional silencing through alterations of genome-wide DNA 
methylation, but also silencing mutants with unchanged methylation levels which do not 
show striking phenotypic alterations activate TSI, indicating that the release of silencing 
from endogenous templates does not require loss of methylation. 
Initially two independent clones representing RNA which is specifically expressed in 
silencing mutants have been cloned. Anticipating that in wild type plants there are probably 
many more DNA templates suppressed by the silencing system, it is remarkable that parts 
of the two cDNAs cloned are closely related to each other and it is most likely that they are 
parts of the same transcript. The three main TSI transcripts of 5000, 2500 and 1250 
nucleotides all contain a middle element isolated as TSI-A (SEQ ID NO: 5). The 5000 nt and 
the 2500 nt transcripts additionally enclose the second isolated element TSI-B (SEQ ID NO: 
6), which is like TSI-A without protein coding capacity. The 5000 nucleotide long transcript 
further comprises a 5' extension (SEQ ID NO: 1 which is similar to SEQ ID NO: 2) encoding 
a putative open reading frame of 648 amino acids (SEQ ID NO: 10). The two 3 1 extension 
clones of TSI-A (SEQ ID NO: 3 and SEQ ID NO: 4) contain a region which can be aligned 
with nucleotides 1-569 of SEQ ID NO: 6 (nucleotides 808-1397 of SEQ ID NO: 3 and 
nucleotides 819-1365 of SEQ ID NO: 4) closely related to TSI-B (77 % identity). 
Both the 5000 and the 2500 nucleotide transcripts are polyadenylated, while the most 
abundant transcript of 1250 nucleotides is absent from the polyA fraction of mom1 RNA and 
might be retained in the nucleus. 

All RNA species originate from unidirectional transcription, but it is not clear if they represent 
separate transcriptional units regulated by different promoters or if they are processing 
products of the same long transcript. A refined analysis of the TSI expression pattern is 
complicated by the multiplicity of potential chromosomal templates and their location mainly 
in the pericentromeric areas. 



-4- 



WO 01/20010 



PCTYEP00/08994 



The novel TSI sequences do not reveal any putative function by sequence similarity to 
protein- or RNA-coding sequences. The only extensive similarity found was to the 3' half of 
the putative, degenerated retrotransposon Athila (Pelissier et a!. 1995). The other part of 
Athila directly adjacent to the TSI template region was not reactivated in the silencing 
mutant. This suggests that epigenetic transcriptional silencing in Arabidopsis is not directed 
towards retrotransposons in general, although its targets may have originated from 
transposition events. This is further supported by the lack of transcriptional reactivation of 
other Arabidopsis retroelements, e.g. the Ta superfamily (Konieczny et al. 1991). Therefore, 
only specific pericentromeric repeats seem to be under epigenetic control, in the same way 
that only a subset of transgenic loci is susceptible to silencing. The existence of remnants of 
transposons is probably due to their chromosomal location rather than to sequence 
specificity, since degenerated retroelements have repeatedly been found in centromeric 
locations in fungi and plants. 

One of the features proposed as a prerequisite for centromere function is late replication of 
the heterochromatic centromeres and pericentromeric areas in Schizosaccharomyces 
pombe and higher eukaryotes. If this was also true for Arabidopsis centromeres, undue 
loosening of suppressive chromatin leading to TSI expression could cause disturbances in 
mitosis, which would result in severe phenotypes. However, the mom mutant plants exhibit 
no abnormalities suggesting mitotic disorders. Therefore, transcriptional reactivation of 
some usually silent pericentromeric repeats, such as described here, does not impair their 
putative function. Alternatively, their silencing may be important under a specific, still 
undefined condition or on a longer time scale. 

Finally, TSI expression is observed in cells growing for a long time in suspension culture. No 
release of TSI silencing is observed in any tissue of Arabidopsis wild type plants, including 
freshly initiated callus cultures. This suggests that an escape from the silencing control is 
not correlated primarily with dedifferentiation but could be the result of prolonged selection 
for fast growing dedifferentiated cells. Such a loss of silencing control could also underlie 
the accumulation of somaclonal variation during prolonged culture and resembles the 
situation in cells of actively proliferating carcinomas. 

Nucleic acids according to the present invention are particularly useful in selecting plants 
which compared to wild type Arabidopsis plants of all available Arabidopsis ecotypes are 
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impaired in transcriptional gene silencing. A method allowing to select such plants 
comprises 

a) separately preparing RNA of a series of plants; 

b) probing said RNA preparations with a nucleic acid according to the present invention; 
and 

c) identifying a plant whose RNA hybridizes with said nucleic acid. 

In a preferred embodiment the probing step is performed after size fractionation of the RNA 
preparation by gel electrophoresis. For detection the probe is either radioactively labeled or 
labeled by other chemical modifications. 

In another preferred embodiment of said method the step of probing consists of hybridizing 
the RNA with an oligonucleotide primer, extending said primer by reverse transcription and 
subsequent PCR amplification of the DNA generated using oligonucleotide primers specific 
for SEQ ID NO: 1, SEQ ID NO: 5 f SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, or 
SEQ ID NO: 27. Plants which allow for the amplification of DNA fragments flanked by the 
oligonucleotide primers are identified as plants whose RNA hybridizes to the nucleic acid 
according to the invention. 

Having available nucleotide sequence information of a genomic region, which is not 
expressed, i.e. transcriptionally silenced, in a wild type plant, allows to produce DNA 
representing at least part of a gene necessary to maintain silencing of this genomic region. 
Preferably the complete gene is produced. A corresponding method of production 
comprises 

(a) mutagenizing-wild type cells or plants by randomly inserting into their genomes a DNA 
tag with known sequence; 

(b) identifying mutants of said cells or plants which express RNA that is not expressed in 
wild type cells or plants; 

(c) cloning genomic DNA surrounding or close to the insertion site of the DNA tag; 

(d) screening a genomic library of wild type cells or plants with the piece of genomic DNA 
obtained in process step (c) or a part thereof; 

(e) identifying clones comprising at least part of the gene affected by the insertion of the 
DNA tag; and 

(f) further processing the clones obtained in step (e) using recombinant DNA techniques. 
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in plant cells and plants mutagenesis is preferably achieved performing T-DNA insertion 
mutagenesis (Dilkes and Feldmann, 1998), or transposon tagging using the En/I (Pereira 
and Aarts, 1998) or the Ac/Ds system (Long and Coupland, 1998) as described in 
Arabidopsis protocols edited by Martinez-Zapater and Salinas, 1998. Other known physical 
or chemical methods of mutagenesis such as fast neutron irradiation or EMS mutagenesis 
(Feldmann et aL, 1994) might require adaptation of the above method, but can be used for 
the production of equivalent DNA involved in the maintenance of silencing as well. 
A convenient way to identify RNA that is expressed in mutant cells or plants but not in wild 
type cells or plants is reverse transcription of said RNA and subsequent PCR amplification 
of the generated DNA using oligonucleotide primers specific for said DNA (RT-PCR). This 
allows to pool the RNA of upto 1000 mutants which increases the speed of the identification 
step considerably. 

The methods described above can be further elaborated and developed into a kit for the 
identification of plants impaired in transcriptional gene silencing. Such a kit necessarily 
comprises 

a) a nucleic acid according to the present invention conveniently labeled to be used as a 
hybridization probe or 

b) an oligonucleotide primer for reverse transcription of RNA and an oligonucleotide 
primer specific for a nucleic acid according to the present invention. 

The oligonucleotide primer for reverse transcription can be a poly T primer or an 
oligonucleotide primer specific for a nucleic acid according to the present invention. The 
primers specific for nucleic acids according to the present invention are designed to allow 
PCR amplification of DNA templates characterized by the nucleotide sequences disclosed 
in the present invention. 

EXAMPLES 

Example 1 : Differential mRNA screening and cloning of Arabidopsis TSI sequences 
Total RNA of the mutant line mom 1 (Amedeo et al, 2000) and its parental line A is isolated 
according to Goodall et al. (1 990) using 2 g fresh weight of two-week-old seedlings. 
Polyadenylated RNA is obtained using Dynabeads Oligo (dT) 25 (Dynal). 2 jig of poly(A) RNA 
is used for suppression subtractive hybridization (SSH, Diatchenko et al, 1996) using the 
PCR-Select cDNA subtraction kit (Clontech) according to the suppliers' instructions. cDNA 
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derived from the mutant line mom1 is used as tester and cDNA derived from the parental 
line A as driver cDNA population. The subtracted library is cloned into vector pCR2.1 
(Invitrogen). 500 individual bacterial cultures from this library are grown according to the 
manual of the PCR-Select differential screening kit (Clontech). To reduce the number of 
false positive clones, the library is primarily screened by inverted Northern blots as 
described by von Stein et al. (1997). Twelve among the 500 primarily selected cDNA clones 
show increased abundance upon hybridization with labeled mom1 cDNA. Direct Northern 
blot analysis comparing total RNA of the wild type and the mutant line with these 12 cDNAs 
as probes reveal a striking genotype-dependent differential expression for two of them. Said 
clones are sequenced using conventional rhodamine or dRhodamine dye terminators from 
PE Applied Biosystems and a Perkin-Elmer GeneAmp PCR system 2400, 9600 or 9700 
thermocycler. The sequence reactions are analyzed using an ABI PRISM 377 DNA 
sequencer. The cDNA clones are named TSI-A (903 bp, SEQ ID NO: 5) and TSI-B (614 bp, 
SEQ ID NO: 6). Both are abundant in the mom1 RNA but are undetectable in Arabidopsis 
line A and wild type Arabidopsis. No consistent differential expression between mutant and 
wild type is observed for the remaining 10 cDNAs. 

5' and 3' extension reactions are performed using Clontech's Marathon Kit according to the 

manufacturer's instructions. Sequence specific primers are 

TA-F1 : 5 ' -TGGTTCACCAGATAAGCTCAGTGCCCTC-3 ' (SEQ ID NO: 1 1 ) and 

TA-F2: 5 ' - cttcagactggataggactaggtgggcg - 3 ' (SEQ ID NO: 12, nested primer), 

for the 3'extension reaction and 

TA-R1: 5 ' -cgcccacctagtcctatccagtctgaag-3 ' (SEQ ID NO: 13) and 

TA-R2: 5 ' -cgcatcaaacaactaacaacgagggcac-3 - (SEQ ID NO: 1 4, nested primer). 

for the 5'extension. 

PCR amplification products are cloned into vector pCR2.1 (Invitrogen). Individual bacterial 
cultures are grown and subjected to colony PCR as described in the manual of Clontech's 
PCR-Select Differential Screening Kit, with the primer combinations used to create the 
extension reactions (Marathon Adapter primer Ap1 (Clonetech) combined with TA-R2 or TA- 
F2 for the 5'- or 3' extension reactions, respectively). To screen for positive TSI-A extension 
clones, the PCR products are blotted and hybridized to TSI-A. All PCR reactions for cloning 
procedures are performed with a polymerase mix performing proofreading activity 
(Advantage cDNA PCR Kit, Clontech). 
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Since only two transcripts are detected in the polyadenylated RNA fraction of mom1 plants, 
this RNA is used for 5* and 3' extensions reactions starting from the TSI-A sequence. Two 
clones each are analyzed at the nucleotide sequence level. The 5' extension yields inserts 
of 2512 bp (clone a, SEQ ID NO: 1) and 1997 bp (clone b, SEQ ID NO: 2), which after 
alignment are 97 % identical to each other. The clones from the 3' extension have a length 
of 1682 bp (clone c, SEQ ID NO: 3) and 1652 bp (clone d, SEQ ID NO: 4) and are 94 % 
identical. Interestingly, both 3' extension clones of TSI-A contain a region of 569 bp closely 
related to TSI-B (77 % identity). This explains the detection of similar RNA species on 
Northern blots with TSI-A and TSI-B as probes and their hybridization to the same YAC and 
BAC clones, and suggests that TSI-A and TSI-B are part of the same polyadenylated 
transcript species expressed in the mom1 mutant. To confirm that the 5" extensions of TSI- 
A are indeed part of the TSI transcripts, a mom 7 Northern blot is probed with a cDNA 
fragment close to the 5' end of the extension (probe ORF corresponding to nucleotides 943- 
1334 of SEQ ID NO: 1). Interestingly, only the about 5000 nt long transcripts in the poly(A) 
fraction hybridize to this probe. Since this class of transcripts hybridizes to TSI-A and TSI-B, 
the 5000 nt transcripts are probably produced from templates containing a particular order 
of all three sequence elements. 



Example 2: Northern and Southern blot analysis and library screens 
Total RNA is either isolated as described by Goodall et al. (1990) or by the RNeasy Plant 
Mini Kit (Qiagen) according to the suppliers' instructions. For Northern blot analysis, the 
RNA is electrophoretically separated after denaturation by glyoxal in a 1.5 % agarose gel in 
phosphate buffer (pH 7) and blotted to nylon membranes (Hybond N, Amersham) using 
standard protocols. The Boehringer molecular weight marker I is used as a size standard. 
For Southern blot analysis, genomic DNA is isolated according to Dellaporta et al. (1983) 
and separated electrophoretically after endonucleolytic digestion. DNA fragments are 
transferred to nylon membranes (Hybond N, Amersham) according to standard procedures. 
Hybridization and washing of Northern and Southern blots and the filters with the YAC 
library is performed according to Church and Gilbert (1984). Probes are labeled with [a- 32 P]- 
dATP by random prime DNA polymerization (Feinberg and Vogelstein, 1983) and exposed 
to X-ray sensitive film (Kodak X-OMAT AR). 

Hybridization of Northern blots using total RNA prepared from 2-week-old seedlings with 
TSI-A of mom1 visualizes four major transcripts with sizes of approximately 5000, 4700, 
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2500 and 1250 nucleotides. A TSI-B probe detects mainly two transcripts of 5000 nt and 
2500 nt. Interestingly, the polyadenylated fraction of mom1 RNA contains only the 
transcripts of 5000 nt and 2500 nt hybridizing to both cDNA probes (TSI-A and TSI-B). From 
the sizes of TSI transcripts detected it is obvious that both TSI clones represent only partial 
cDNAs. TSI expression is meiotically heritable and persists through 6 selfed generations of 
mom1 with the same pattern of transcripts. 

To examine TSI expression in other genotypes known to affect gene silencing, total RNA of 
several Arabidopsis mutant and transgenic strains known to be affected in gene silencing is 
probed with TSI-A. All the som mutants som1 to som8, described by Mittelsten Scheid et al 
(1998) to be impaired in the maintenance of transcriptional silencing similar to mom1 y show 
a high level of TSI-A expression. The mutation ddm1 t originally identified to have decreased 
DNA methylation (Vongs et al. 1993), and later revealed to release transcriptional gene 
silencing from different loci (Mittelsten Scheid et al, 1998; Jeddeloh et al., 1998) also shows 
a high level of TSI-A expression. TSI-A expression is also expressed in a transgenic line 
described by Finnegan et al (1996) which shows decreased DNA methylation due to 
overexpression of DNA methyltransf erase antisense mRNA as well as in a further 
Arabidopsis mutant affected in the DNA methyltransferase gene (said mutant is referred to 
as ddm2 and has been provided by Eric Richards). Moreover, the silencing mutants hog1 
and s//7, but not sil2 described by Furner et al (1998) express sequences hybridizing to TSI- 
A. Importantly, mutations affecting posttranscriptional gene silencing such as sgsl and 
sgs2 described by Elmayan et al (1998) and egsl described by Dehio and Schell (1994) do 
not express RNA which hybridizes to TSI-A. 

Comparison of patterns of TSI-A expression in the different genotypes reveals genotype 
specific differences in the stochiometry of the different RNA species. mom1 plants reveal a 
different expression pattern of TSI-A and TSI-B as compared to som1 plants. These results 
indicate that a particular genetic deficiency in the transcriptional silencing system leads to a 
differential but specific activation of TSI templates. However, we observe variation in these 
activation patterns between different sources of plant material and different RNA 
preparations. Therefore, it is possible that patterns of TSI expression are more flexible and 
probably also controlled by still unknown factors acting in the mutant background, or by 
different stabilities among the transcript populations. 

TSI-A and TSI-B are used as probes for Southern blots to determine the source of TSI 
transcripts and the organization of their template(s). The blots reveal that multiple copies of 
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TSI-A- and TSI-B-homologous sequences are present in the genome of Arabidopsis. Copy 
numbers are assessed by reconstruction experiments to approximately 130-300 copies of 
TSI-A. 

To examine the degree of evolutionary conservation of the TSI arrangement, DNA of five 
Arabidopsis ecotypes (Zurich, Columbia, Landsberg erecta, Wassilevskija, C24) is 
compared by Southern blot analysis and hybridization to the TSI-A probe. Genomic DNA is 
digested with Dra\ that has a single recognition site within TSI-A and Sspl that does not cut 
within TSI-A. A significant conservation of the TSI-A pattern among different ecotypes is 
observed, with two main Dral repeats of 4 kb and 1 .3 kb and two major Sspl fragments of 
1 1 kb and 4 kb. Some minor differences specific for a particular ecotype indicate a limited 
genetic polymorphism within TSI-A. Probing the same membrane with TSI-B reveals 
complex banding patterns different in each ecotype which might indicate a lower degree of 
conservation for TSI-B, although the differences of the Southern blot patterns between TSI- 
A and TSI-B can also be explained if TSI-A is an internal part of a longer repeated element, 
and TSI-B is located proximal to a flank between repeated elements and variable single 
copy DNA regions. 

After hybridizing TSI-A and TSI-B to the CIC YAC library covering 4 genome equivalents 
and 92% of the Arabidopsis genome sixty-two CIC clones out of 1 152 turn out to hybridize 
with the TSI-A probe. Twenty-six of these contain also the pericentromeric 180-bp-repeat, 7 
contain 5S RNA genes known to be located in the vicinity of a centromere, and 16 clones 
contain other markers that map close to centromeres. Only 4 of these clones map outside 
of centromeric regions. Similar mapping of TSI-B results in hybridization to all TSI-A 
positive CIC clones, with additional 7 clones hybridizing to TSI-B only. Thus both TSI 
repeats are concentrated in the pericentromeric regions of Arabidopsis chromosomes. 

After hybridizing TSI-A and TSI-B to a cDNA library prepared with RNA isolated from mom1 
mutant plants, 22 hybridizing clones are further analyzed by sequence analysis. RNA is 
isolated from 2-week-old seedlings of the mom1 mutant plant according to Goodall et al. 
(1990). The cDNA library is prepared using the Uni-ZAP XR library construction kit 
(Stratagene) according to the manufacturer's protocol. cDNA fragments larger than 500 bp 
are selected using the cDNA size fractionation columns from Gibco. 7 clones contain SEQ 
ID NO: 7 (TSI-A-15), 5 clones contain SEQ ID NO: 8 (TSI-A-2) part of which is identical to 
SEQ ID NO: 4. 
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Example 3: Database searches 

Sequence analysis is performed using the GCG software (Wisconsin Package VersionlO.O, 
Genetics Computer Group (GCG), Madison, Wisconsin). For identity searches, GenEMBL, 
the Arabidopsis thaliana database (http://genome-www3.stanford.edu/), Kazusa 
Arabidopsis opening site (KAOS; http://zearth.kazusa.or.jp/arabi/) and Swissprot are used. 
Peptide sequences are analysed by GeneQuiz (http://columba.ebi.ac.uk:8765/gqsrv/submit) 
or Expasy (http://www.expasy.ch/). 

Searches within the genomic sequence databases GenEMBL, Arabidopsis thaliana 
Database, and KAOS confirm the presence of multicopy sequences related to TSI-A and 
TSI-B, which are distributed over all five chromosomes of Arabidopsis thaliana. Importantly, 
very often single BAC clones contain sequences homologous to both cDNA clones. In some 
cases, TSI-A and TSI-B related sequences are found more than once on the same BAC 
clone, suggesting that TSI-A and TSI-B belong to a clustered repetitive element. 

The significant sequence heterogeneity between the cDNA classes and duplicates of the 5' 
and 3' extensions of TSI-A indicate that they originate from different activated repeats. To 
facilitate the data base search for a possible genomic template of the 5000 nt transcript 
among the multiple related copies, the overlapping cDNA sequences are combined to form 
a continuous 4860 bp sequence of "virtual" cDNA (SEQ ID NO: 9) which is used to search 
the Arabidopsis genomic sequence databases. A particular BAC clone (TAMU BAC T6C20, 
accession number AC005898) has 91 % identity to the combined cDNA sequence. Further, 
the search uncovers a chromosomal DNA stretch (BAC F7N22, accession number 
AF058825) 99% identical to the abundant cDNA A-15 of Example 2 (SEQ ID NO: 7). The 
genomic sequence of the transcribed region 5' to the region defined by SEQ ID NO: 7 is 
given in SEQ ID NO: 27. It is identical to nucleotides 65081-68202 of BAC F7N22. Both 
sequences are located at the pericentromeric region of chromosome five. The TSI 
sequence defined by nucleotides 65080 to 70370 on BAC F7N22 is 54 % identical to the 
retrotransposon-like repeat named Athila. The identity of this sequence as a 
retrotransposon is deduced from Arabidopsis genome sequences around heterochromatic 
regions that are marked by the presence of 180 bp satellite repeats. The 10.5 kb sequence 
of Athila has several characteristics of a retroelement, like long terminal repeats (LTR), a 
polypurine track (PPT) and a primer binding site (PBS) for tRNA priming of the reverse 
transcriptase, but its open reading frames do not share homology with proteins known to be 
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involved in transposition. 

The TSIs map to the 3' terminal part of Athila. TSI-A covers a part of the 3' non-coding 
region of the putative retrotransposon and TSI-B corresponds to the PPT and a part of the 
3' LTR. The sequence of the 5' TSI-A extension encodes a possible open reading frame of 
648 amino acids length (SEQ ID NO: 1 0) with 51 % identity to 604 amino acids of the ORF2 
deduced for Athila. The sequence coding for this ORF is also present on the TAMU BAC 
T6C20, however, the ORFs encoded by the two cDNAs clone a (SEQ ID NO: 1) and clone b 
(SEQ ID NO: 2) and the BAC are interrupted by translational stop codons after 398 amino 
acids (clone a), 83 amino acids (clone b) and 46/465/496/499/549 amino acids respectively 
(BAC T6C20). The ORF2 sequence present on BAC F7N22 is highly degenerated by five 
deletions of 2-31 bp and five insertions of 3-10 bp. This further supports the assumption 
that this sequence is derived from a putative but degenerated retrotransposon. Data base 
searches for proteins similar to the potential product of the 648 amino acids ORF do not 
yield significantly similar polypeptides neither to proteins usually encoded by retroelements 
nor to any other known polypeptides. 

Example 4: RNAse protection assays 

RNase protection assays are performed according to Goodall et al. (1990) with minor 
modifications. To assay the direction of TSI transcription, the pCR2.1 based plasmid 
containing the TSI-A insert is cut by EcoRI creating a fragment of 781 bp which is ligated 
into the vector pGEM-7Zf(+) (Promega). To map the 5' transcription start, the probe is 
generated by amplifying the BAC F7N22 region between positions 64929 and 65567 and 
inserting the product into the pGEM-7Zf(+) vector (Promega). Labeled probes are 
synthesized by in vitro transcription of the linearized plasmid in the presence of [<x- 32 P]-UTP 
using T7 polymerase (Promega) or Sp6 polymerase (Boehringer) and purified by 
electrophoresis (Goodall et al., 1990). Single stranded RNA is cleaved by either 4 ug RNase 
A and 0.6 U RNase T t (RNase A/T assay) or by 20 U RNase T, (RNase T assay). Protected 
fragments are separated on a denaturing 6 % polyacrylamide gel. The dried gel is exposed 
to a Phosphorlmager screen (Molecular Dynamics) and to X-ray sensitive film. 
To determine the polarity of TSI transcription, RNase T and RNase A/T protection assays 
are performed with TSI-A probes of opposite polarity. TSI-A sequences are used as probes 
since TSI-A is present in all transcripts detected on Northern blots. There is no evidence for 
protection of the probe corresponding to the sense strand. This suggests the lack of any 
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TSI antisense RNA and a unidirectional transcription of the TSI templates. Interestingly, 
RNase digestions with a TSI-A antisense probe creates a complex pattern of TSI-A 
protected bands. This suggests that many different but related RNAs hybridize to the probe. 
Since a fragment of 781 nt as expected for the protection of the entire TSI-A probe is 
visible, it can be concluded that TSI-A is part of an activated transcript throughout and not 
an artifact generated by template switch during the SSH procedure. Furthermore, some of 
the protected TSI-A fragments are clearly more abundant than others, suggesting either a 
structural conservation of particular regions of TSI-A within related RNAs, or alternatively a 
higher abundance of certain transcript subspecies. 

The sequence information of BAG F7N22 is used to determine the position of the 
transcription start for the longest TSI transcript. An antisense RNA probe for RNase A/T 
protection is produced spanning the 638 nucleotides between positions 64929 and 65567. 
The probe is hybridized with total RNA from ddm1, som7 and mom 1. In all RNA 
preparations, a fragment of approximately 480 nt (±10 nt) is protected and allowes 
positioning of the TSI transcription start on BAC F7N22 to 65087 (±10 nt) in different 
mutants. 

Example 5: Reverse Transcription PCR (RT PGR) 

Reverse transcription is performed with 1 ^g total RNA from mom1 in the presence of 1 mM 
dNTPs, 4-20 U RNasin (Promega), 1x AM RTase buffer (Boehringer) and 25 U AM reverse 
transcriptase (Boehringer) at 37°C for 1 hour, followed by heat inactivation of the reaction 
mixture. As template for PCR 50 ng reverse transcribed RNA primed by gene specific 
antisense primers (BA-R1 , BA-R2, AT-R1, and TA-R1, see below) or 100 ng genomic DNA 
or 100 ng cDNA are used. PCR is started with 3 min denaturation at 94°C, followed by 30 
amplification cycles (denaturation at 94°C/30 sec, annealing at 62°C/30 sec, and elongation 
at 72°C/30 sec) in the presence of 0.2 mM dNTPs, 0.4 \iM forward and reverse primers, 1x 
Taq DNA polymerase buffer (Boehringer) and 0.25 U Taq DNA polymerase (Boehringer). 
The nucleotide sequences of the primers used for RT-PCR are: 

AT-F1: 5 ' -CGATAACATCGACCGTATTGCTCGCC-3 ' (SEQ ID NO: 15) 
AT-R1 : 5 ' -AACTAGCTCCCATCCGTCTTCGACATCC-3 ' (SEQ ID NO: 1 6) 
AT-F2: 5 ' -TGCATCACACCGGATTGGATTGAC - 3 ' (SEQ ID NO: 17) 
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AT-R2: 


5 


BA-F1: 


5 


BA-R1 : 


5 


BA-F2: 


5 


BA-R2: 


5 


F7-F1: 


5 


F7-R1: 


5 


GS-F1: 


5 


GS-R1 : 


5 



-TCAACCTTTTGCCCCAACAACCACTC-3 ' (SEQ ID NO: 23) 



As a positive control for the PGR reactions, the TA-F1 and TA-R1 primers are used. The 
reverse transcriptase region of the pol gene of Arabidopsis Ty1/copia-like retrotransposon 
family is amplified as described by Konieczny (1991) with immaterial modifications. 

Since the nucleotide sequence of the TSI transcripts TSI-A and TSI-B is related to the 
nucleotide sequence of the 3' half of retrotransposon-like element Athila including the 
second ORF and the 3* LTR, we examined by RT-PCR whether transcription of the 5' part 
of Athila including the first ORF is activated in mom1 plants. Five primer pairs are chosen 
(BA-F1 - BA-R1 ; BA-F2 - BA-R1 ; AT-F1 - AT-R1 ; AT-F2 - AT-R2, F7-F1 - F7R1 ) according 
to the sequence information about Athila and the related parts of BAC T6C20 and BAC 
F7N22. All primer combinations amplified the expected products from genomic template 
DNA, but no PCR product could be obtained from mom1 RNA, regardless, whether cDNA 
synthesis was started from an Athila- or BAC-specific reverse primer or from polyT-primed 
cDNA (data not shown). Activation of TSI therefore is limited to sequences related to the 3' 
part of Athila. 

The two classes of isolated cDNAs share only approximately 50% identity with Athila. To 
directly address the question of whether Athila is expressed, RT-PCR experiments are 
performed with Athila-spec\1\c primers (GS-F1, GS-R1) in the TSI homologous region. 
However, the corresponding fragment cannot be amplified from RNA of mom1 seedlings, 
suggesting that only a subset of Athila-l\ke sequences but not the Athila element itself is 
reactivated in the mutant background. 

To investigate whether other retroelements are transcriptionally activated in mom1, 
degenerated primers in a conserved region of the reverse transcriptase gene used to clone 
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and describe the Ta superfamily of Arabidopsis retrotransposons (Konieczny et al., 1991) 
are used to investigate, if members of the Ta family are transcribed in the mutant 
background. Although the expected 268 bp fragment can be amplified from genomic DNA, 
no amplification is achieved in RT-PCR with mom1 RNA as template. This indicates, that, ir 
spite of the TSI homology to retrotransposons, these elements are not generally activated ii 
the mom1 mutant. 



Example 6: TSI expression after application of stress (salinity, UV-C, pathogen) 

Induction of TSI upon UV-C is tested on Northern blots with RNA samples from 1 week-old- 
seedlings subjected to UV-C treatment of 1 kJ/m 2 or 5 kJ/m 2 which are collected at several 
time points within 1 hour (Revenkova et al., 1999). The effect of osmotic stress is tested on 
Northern blots with RNA from one-week-old seedlings that are transferred for 24 hours to 
medium with NaCI concentrations of 0, 0.04, 0.08 and 0.12 M (Albinsky et al., 1998). To test 
TSI expression upon pathogen stress, RNA of 3-week-old seedlings either mock treated or 
infected with Peronospora is analysed by Northern blot analysis. To verify the appropriate 
pathogen response, induction of PR1 expression is monitored by reprobing the membrane 
with a PR1 probe. 

In young seedlings (2 weeks old) and in different tissues of mature wild type plants (roots, 
shoots, leaves, flowers, siliques), TSI expression cannot be detected. The application of 
various stress treatments namely elevated salinity, UV-C, or pathogen infection, does not 
activate TSI in wild type plants. TSI expression is also not detected in freshly initiated callus 
cultures, and transcriptional suppression of TSI is stable even after several in vitro 
passages of the callus culture. However, the only exception so far are cells derived from 
wild type Arabidopsis (literature) growing for a long time in suspension culture. These cells 
express TSI-A, indicating release of TSI silencing under these conditions. 
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What is claimed i& 



2. 



A nucleic acid having the formula R A -R B -R C , wherein 

- Ra, Rb and R C constitute component sequences consisting of nucleotide residues 
independently selected from the group of G, A, T and C or G, A, U and C, wherein 
G is Guanosinmonophosphate, 
A is Adenosinemonophosphate, 
T is Thymidinmonophosphate, 
U is Uridinmonophosphate and 
C is Cytidinmonophosphate; 

R A and R c consist independently of 0 to 6000 nucleotide residues; 

- R B consists of at least 50 nucleotide residues; and 

- the component sequence R 8 is at least 80% identical to an aligned component 
sequence of SEQ ID NO: 1 , SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO- 4 SEQ ID 
NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO- 9 or 
SEQ ID NO: 27. 

The nucleic acid of claim 1, wherein R B consists of at least 100 nucleotide residues and 
.s at least 85% identical to an aligned component sequence of SEQ ID NO- 1 SEQ ID 
NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO- 7 
SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 27. 

3. The nucleic acid of claim 1 , wherein the nucleic acid sequence of R B is the sequence 
given in SEQ ID NO: 7, SEQ ID NO: 9 or SEQ ID NO: 27. 

4. The nucleic acid of claim 1 , wherein R A or R c comprise one or more additional 
component sequences with a length of at least 50 nucleotide residues and at least 90% 
identical to an aligned component sequence of SEQ ID NO: 1 , SEQ ID NO- 2 SEQ ID 
NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO- 8 
SEQ ID NO: 9 or SEQ ID NO: 27. 

5. The nucleic acid of any one of claims 1 to 4 comprising an open reading frame 
encoding a protein comprising a component sequence of at least 200 amino acids 
length being at least 85% identical to an aligned component sequence of SEQ ID NO- 
10. 
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6. A method of selecting a plant which compared to a wild type plant is impaired in 
transcriptional gene silencing, comprising 

a) separately preparing RNA of a series of plants; 

b) probing said RNA preparations with a nucleic acid according to claim 1 ; and 

c) identifying a plant whose RNA hybridizes with said nucleic acid. 

7. The method of claim 6, wherein process steps b) and c) comprise reverse transcription 
of the RNA and subsequent amplification of the generated DNA using oligonucleotide 
primers specific for SEQ ID NO: 1 , SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 or 
SEQ ID NO: 27. 

8. A method of producing DNA representing at least part of a gene necessary to maintain 
silencing of another gene in a cell or plant, comprising 

(a) mutagenizing wild type cells or plants by randomly inserting into their genomes a 
DNA tag with known sequence; 

(b) identifying mutants of said cells or plants which express RNA that is not expressed 
in wild type cells or plants; 

(c) cloning a piece of genomic DNA surrounding or close to the insertion site of the 
DNA tag; 

(d) screening a genomic library of wild type cells or plants with the piece of genomic 
DNA obtained in process step (c) or a part thereof; 

(e) identifying clones comprising at least part of the gene affected by the insertion of 
the DNA tag; and 

(f) further processing the clones obtained in step (e) using recombinant DNA 
techniques. 

9. The method of claim 8, wherein plants are mutagenized using T-DNA, Ac/Ds or EN/I 
insertions, chemical or physical mutagenesis. 

10. The method of claim 8, wherein mutants expressing RNA that is not expressed in wild 
type cells or plants are identified by reverse transcription of said RNA and subsequent 
amplification of the generated DNA using oligonucleotide primers specific for said DNA 
(RT-PCR). 

11. The method according to claim 10, wherein pools of mutants are analyzed by RT-PCR. 
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12. The method according to claim 8, wherein the complete gene necessary to maintain 
silencing is produced. 

13. A kit for the identification of plants impaired in transcriptional gene silencing comprising 

a) a nucleic acid according to claim 1 conveniently labeled to be used as a 
hybridization probe or 

b) an oligonucleotide primer for reverse transcription of RNA and an oligonucleotide 
primer specific for a nucleic acid according to claim 1 . 
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^ (54) Title: TRANSCRIPTIONALLY SILENCED PLANT GENES 
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<141> 
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<210> 1 
<211> 2512 
<212> IUA 

<213> Arabidopsis tnaliana 



<400> 1 

tcgccgttcc agcttcgcat actctctcac 
ttcgcctcaa catcttcgcc ggagtttctc 
cactcgacca ccggaccggc ttcaccatct 
cattcactgc gcctccattc gtctctttac 
tctaaattcg ccgtttactc gaccacactg 
ctatacccga agccacaata tcactctact 
ggtttagtgt gtgtgtttat ttgaactaac 
ttttcaggga atcaatatga gcaactacag 
cgtcgatgaa gctgaatctt ggtcaactag 
cttcagagcc gagacccaac gctcagtagc 
tagaggaaag agagcgatga ccagcagata 
gtatgagcct gagtcatggc acagagaaac 
agtggaagag tacatcagac ttttcgagct 
ttatgagact ctagcccagc ttaggctact 
ccatcttgag acgctgatgt cttacccgta 
tctctccact ctgcaagtgg agttgtatca 
agggttggga ttcttgactt tttcagttaa 
cttggaagga ttatttggtt ttcccagtgg 
agagttgaag gatttgtggt taaccattgg 
taagagcaac cagattcgaa gccctgtgat 
tctgtacccc agggaatcta caggcaccgt 
tgcactcaag ggtattctcc ggagaacaaa 
tgatacacca ccggtcatgc ttctgttgat 
caccaacggg aggaagaagg tgcgaggagc 
tctgattgca tgtggtgtac ctctcacgtc 
agatcacttg cgtcgttgtg agtttctgga 
caaattcgag cactccctga cccgaacagc 
aaccatactt tagggtgaga acattgactt 
gagcactcca ccgactgatg acaatgtccc 
tgagacggat gaggataggg aggaggagta 
cgtacctcca gcgcaggaga gcaagagctt 



cgtctctcgt ttcactcgac cacttcacac 60 
gccattgtcc gtgcttccgt catctccgtt 120 
ctcaactatc caccattcac tcgacctcgc 180 
tcgactgctc ctcaaaccgc caccgtcttc 240 
ttacgtctct cattcgtgta cagtcgaccg 300 
cgaccgtttc actcgatcgc gtacttgact 360 
atattgatat ttggttttga gttacattct 420 
tggcgaatcc tccatggatg cggattacaa 480 
accagagaga gagcaacagg cttatgagag 540 
tcgacgcaat gaaaggagag ctgagattgc 600 
tgagttgatc gacgaagata ttgacgtcga 660 
aaaactgttg aacaagcctg atgaagttac 720 
gaacgacttc tggggagcga ggtacccctg 780 
ggaggacgta cagcacttat tcgagaagtg 840 
cgtcgcttac aagaaggaaa caatagagtt 900 
gggacttact gcagatgaac tggagagtga 960 
cgagcagcgt taccagctat ctatcaagag 1020 
aaagggaact aaacccaagt tcgaaaggga 1080 
gaacgatttg gcgctcaact ctgcaaggtc 1140 
ccgctactat cagcgctcag tagcgaatgt 1200 
gtctaacaca gacatggaga tgattgattc 1260 
ggggaagaag gtcctaaagg gcgaccttaa 1320 
ccatatgtgt ggatacagga agtgggcgca 1380 
cctttgtgtg ggtggcgttg tgacaccgat 1440 
tccagggttt gatccgagga tgatggattt 1500 
gtacgacatg gttggcgatt tctatcgcta 1560 
caacattttg cttccctgca tcgaggccac 1620 
cagacctgcg cgtgattacc tctactttga 1680 
tacgacggaa gctacagagg atgattttgc 1740 
tgatacgagc atgtatcatt tcagtgagca 1800 
gagtgaagct cacagaaaca acagtaagtt 1860 
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gcagaggtgg tgcaagaaac aagataggtt 
tctaacggac aagataagtt gcttctcttc 
tcaggacatg ccttcgaaga gatatgacgc 
aagtcaccac aggcctgagc ctagtgaccg 
atcattcgag cctcgggagc tcgggagaaa 
caggagtaga cgacttctcc agtcccgtag 
aagaagagag gtcgagtatc atcagagcgg 
gtacccccag ggggaagctg agacacaaca 
acaggcagct attgacgacc aactccgctc 
accattgtaa t^taccatct cttgtttttt 
gtactctctt ccaaatttgg tcacacagtg 



acttatcaag tgcttcaagg ccatcacgtt 1920 
taccacagct attccgcagg gagagcgtcc 1980 
gccagggcca agtcatcaca ggcctgagcc 2040 
agtagtacca ccagtccctg caaggcattc 2100 
gaagaaggct gcactcgcta ggtctggcag 2160 
cttacgcgac cgcggtgctg gccgcagcag 2220 
tgctggccgc ggcgaaggag cagaggtcga 2280 
gggagattct tcgatggcct gggagcaatc 2340 
cttcttccac tgaggtatgc acctcactcc 2400 
attttgtttt tgtgatgtgt tttgtcctga 2460 
gactgtgtga tttaagtttg gg 2512 



<210> 2 
<211> 1997 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 2 

ccgggcaggt caacaggctt atgagagctt 
acgcaatgaa aggagagctg agattgctag 
gttgatcgac gaagatattg acgtcgagta 
gctgttgaac aagcccgatg aagttacagt 
cgacttctag ggaacgaggt acccctgtta 
ggacgtacat cacttattcg agaagtgcca 
cgcttacaag aaggaaacaa tagagtttct 
acttactgca gatgagctgg agagtgaagg 
gcagcgttac cagctatcta tcaagagctt 
gggaactaaa cccaagttcg agagggaaga 
cgatttggca ctcaactctg caaggtctaa 
ctactatcag cgctcagtag cgaatgttct 
taacacagac atggagatga ttgatgttgc 
gaagaaggtc ctaaagggcg accttaatga 
cctgtgtgga tacaggaagt gggcgcacac 
ttgtgtaggt ggcgttgtga caccgattct 
agggtttgat ccgaggatga tggatttaga 
cgacatggtt ggcgatttct atcgctacaa 
cattttgctt ccctgcatcg aggccacaac 
acctgcgcgt gattacctct actttgagag 
gacggaagtt acagaggatg atattgctga 
tacgagcatg tatcatttca gtgagcacgt 
tgaagctcac agaaacaaca gtaagttgca 
tatcaagtgc ttcaaagcca tcacgtttct 
cacagctatt ccgcagggag agcatcctca 
agcgccaagt catcacaggc ctgagccaag 
agtcccacca gtccctgcaa ggcattcatc 
gaaggctgca ctcgctcggt ctggcagcag 
acgcgaccgt ggtgctggcc gcagcagaag 
tggccgcgac gaaggagcag aggtcgagta 
gattcttcga tggcctggga gcaatcacat 
ttccactgag gtatgcacct cactccacca 
gtttttgtga tgtgttttgt cctgagtact 
tgtgatttaa gtttggg 



tagagctgag acccaacgct cagtagctcg 60 
aggaaagaga gcaatgacca gcagatatga 120 
tgaacctgaa tcatggcaca gagaaacgaa 180 
agaggagtac atcagacttt tcgagctgaa 240 
tgagacttta gcccagctgg ggctactgga 300 
tctggagacg ctgatgtctt acccgtacgt 360 
ctccactctg caagtggaga tgtatcaggg 420 
gttggggttc ttgacttttt cagttaacga 480 
ggaaggatta tttggttttc caagtggaaa 540 
gttgaaagat ttgtggttaa ccattgggaa 600 
gagcaaccag attcgaagcc ctgtgatccg 660 
gtaccccagg gaatctacag gcatcgtgtc 720 
actcaagggc attctccgga gaacaaaggg 780 
tacaccaccg gttatgcttc tgttgatcca 840 
caacgagaag aagaaggtgc gaggagccct 900 
gattgcatgt ggtgtacctc tcacgtctcc 960 
tcacttgcgt cgttgtgagt ttctagagta 1020 
attcgagcac tccctgaccc gaacagccaa 1080 
catacttcag ggtgagaaca ttgacttcag 1140 
cgctccaccg actgatgaca atgtccctac 1200 
gacggatgag gatagggagg aggagtatga 1260 
acctccagcg cgggagagca agagcttgag 1320 
gaggtggtgc aagaaacaag ataggctact 1380 
gacggacaag ataagctgct tctcttctac 1440 
ggacatgcct tcaaggagat atgacgcgcc 1500 
tcaccacagg cctgagccta gtgaccgagt 1560 
attcgagcct cgggagctcg ggagaaagaa 1620 
gagtacacga cttctacagt cccgtagctt 1680 
aagagaggtc gagtatcatc agagcggtgc 1740 
ccccacgggg aagctgagac acaacaggga 1800 
gcagctattg acgaccaact ccgctccttc 1860 
ttgtaatata ccatctcttg tttttatttt 1920 
ctcttccaaa tttggtcaca cagtggactg 1980 

1997 
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<211> 1682 
<212> ENA 

<213> Arabidopsis thaliana 



<400> 3 

cttatattat gggttgggat gtgtttaaag 
gggaaagaat tctaggggaa gtaagctaaa 
ggaatgttaa agaaaagaat gaggttcttg 
ggttttaaag attaaaaaaa aaacaggaac 
gatgtatcaa gagcgttgag aaagcttctc 
tgaaaaagag tttgaaagat tcatgagtgc 
ggttttacca ttagaacttc attgttatac 
cataatttgg gacttacctt tagcattcta 
tgttacttaa gcctattcta taagggacca 
atgagttcat tgatgatgca ttgcttgatt 
attggtagat ttgaaagcat gtgtaggtcg 
gcatggctaa cgtttttgag tagaattcaa 
acattgattt tatttgctct atcatatgct 
ccttcaacta tgtcttctta tttgcttgag 
atatgtctat aatttgcatg ttttcagtgt 
tatcattcat cactgtttta tatcatttct 
ataactttgc atacatattg catttctgag 
agcaaattgg aagaaatgag ccagaaccag 
tttggggcca ttcttcccac atactcggcc 
cccatccact cgaccaccgg gtcgagtaac 
gaccccaggt cgagtatcac ttcgccacac 
caagttactc gaccccctgg tcgagtatca 
gacactcgat cacgtcttca cagtctactc 
caaggaagag aagaggagaa gacaaagtgc 
cgaagcccat ctcggcccat tatctctcta 
tactatcatt ttatttcgtt ttgtataaat 
tagtcgacat tgagtttttt ttgcttcagt 
ttttgcttct gcaacctgta attcgagatt 
tt 



aaaaggggga attcattgtt gataaggaaa 60 
gaagttagaa aaaatctagt aaaggttttg 120 
ttagctaaag aataagggtt aaaagccttt 180 
cttagttgtt aaagaaatcc aaacccgcta 240 
ctagagttaa gagaaaagaa aagaatgata 300 
aaagggtaga gttaagttgg gacaggagtt 360 
tctgggtaga tgggatctta tctctgtatg 420 
ctaaagctca atcattcttg agggatcccc 480 
tctttgtctc ttgaccttca ccttggccga 540 
cgcgttccag aactaatgaa tgttaaaggg 600 
agtataagag acggattgat tgaaaacaag 660 
tcatatcgca tcttagaact accaacttgg 720 
ttggttttga gtccccgcct tcactcctct 780 
ggcaagcaaa gactaagttt gagggagttg 840 
ccattcatca tcgttttgag tccagtttcg 900 
catcattctt gcatactttg catgattagg 960 
ttgttttcag gtgatttgga gctgtttgca 1020 
aagacatact cgacccctag gtcgagtgac 1080 
cccaggtcga gtgactttgg agccattctt 1140 
cttagctcag gccactcgat gacactactc 1200 
cacctgacaa cactcgacca atcactctac 1260 
tcactcacca ccatcagcat cactcgaccg 1320 
aaatccgcag tcaaccagac aagctgagca 1380 
ttggaagcgg cctggacctc catcggatca 1440 
tgggccgagc gattaggtta ttggcccgtc 1500 
agatgtctta gggttttgtc ctgagacatc 1560 
tttattttct gttctactct gctgcgccgc 1620 
tttccaagtt attcagattc cgcatttgat 1680 

1682 



<210> 4 
<211> 1652 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 4 

cttatatcat gggtttggat cagtttaaaa 
gaaagggaaa gaattctagg ggaagtaagc 
aaggttttgg gaatgttaaa gaaaagaatg 
aaagcctttt gttttaaaga ttaaaaacag 
gctagatgta tcagagtgtt gagaaagctt 
atatgaaaaa gagtttgaaa gattcatgag 
ggactggagt tgggattacc attagagctt 
atctctgtat gcataacttg ggacttacct 
gagagatccc ctgttactta agcctattct 
accttagcca aatgagttca fctgatgatgc 
atgttaaagg gattggtaga tttgaaagca 
ttgataacaa ggcatggcta acgttttcga 
taccaacttg gacattgatt ttatttgctc 
ttcaaacctc tccttcaact atgtcttctt 



aaaaaaaagg gtgaattcat tgttgataag 60 
taaagaagtt agaaaaaaaa aaatctagta 120 
aggttcttgt tagctaaaga agaagggtta 180 
gaaccttagt tgttaaagaa atccaaatac 240 
ctcctagagt taagagaaaa gaaaagaatg 300 
tgcaaagggt agagttaagt tcttgtattg 360 
cattgttata ctatgggtag atgggatttt 420 
ttagcattct actaaagctc aatcattctt 480 
gtaagggacc atctttgtct cttgaccttc 540 
attgtttgat tcacgttcca gaactaatga 600 
tgtgtaggtc gagtataaga gacggattga 660 
gtaaaattca atcatatcgc atcttagaac 720 
tatcagatgc tttggttctg agtccccacc 780 
atttgcttga gggcaagcaa agactaagtt 840 
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tgggggagtt gatatgtcta taatttgcat 
gtccagtttc gtatcattca tcactgtttt 
gcatgattag gatagctttg tatacatatt 
agctgtttgc gagcaaattg gaagaaacga 
ggtcgagtga ctttggagcc attcttccca 
gctcaggcca ctcgatgacg ccactcgt.cc 
cctgaccaca ctcggccgtt cactctacca 
actcaccacc aacaccatca ctcgaccggg 
atccgcactc aaccagacaa gctgagcaca 
ggaagcggcc tggacctcca tcggatcacg 
atgggccggg cgattaggtt attggcccgt 
tagatgtctt agggttctgt accaggacat 
tttattttct gttttctctg ctgcgccgct 
ttccaagtta ctcagattcc gcatttgatt 



gttttcagtg tccattcatc atcgttttga 900 
atatcatttc tcatcattct tgcatacttt 960 
gcatttctga gttgttttta ggtgatttgg 1020 
gccagaacaa gaagccatac tcgaccccct 1080 
tctactcgac ccgggggtcg agtaacctca 1140 
cccctggtcg agtatcactt cgccacacca 1200 
cgttactcga ccccctggtc gagtatcatc 1260 
cactcgatca catcttcata gtctactcaa 1320 
aggaagagaa gaggagaaga caaagtgttt 1380 
aagaagccca tctcggccca ttatcattct 1440 
ctactatcat tttatttcgt tatgtataaa 1500 
ctagtcgaca ttgagttttt ttgcttcagt 1560 
tttgtttctg caacctgtaa ttcgagattt 1620 
tc 1652 



<210> 5 
<211> 903 
<212> ena 

<213> Arabidopsis thaliana 
<400> 5 

tactctcttc caaatttggt cacacagtgg 
aggaagtgtg tgttgcattg tataatcttg 
caaaaaaatt gaaaaattcc agaaaatgat 
attgcattta ggatcgagtt tagagtgttt 
taataatgag atagccttgt aagcattttg 
ttagttgttt gatgcgttgt cattgaaatt 
tctactcgac cacactgtta ggatctgata 
atttagaatt atcatgtctt ggcatcgaat 
attttcttac tcattttaac cactcttgtt 
cagttaaccc atacccaaac ctgaactttc 
gtgaggtctt atttcgattg agcttggtag 
tgtctcatgt agttctagtt tgcgttcttc 
tcatgggttg ggatgtgttt aaaagaaaag 
agaattccag gggaagtaag ctaaagaagt 
atgttaaaga aaagaatgag gttcttgtta 
ttt 



actgtgtgat ttaagtttgg gggagggctc 60 
agtttgcatt catctaaggc atagaaaaac 120 
ttcacaaaaa tagagtgttc atgtagttgc 180 
cgtttaggat tgttgcatat gcatagggga 240 
gttcaccaga taagctcagt gccctcgttg 300 
gaagtaagaa ctgcacgatg cctagattgc 360 
tcattcccta tcaatttgaa cttgaatctg 420 
ttgaactcat ggatacccta aaatacttgg 480 
gatccaagta gctgactctc cttattagag 540 
tttcaagccc tatatcactt gtgagtgttt 600 
aaagtgttag gttcgtaacg acagagatag 660 
agactggata ggactaggtg ggcgcttata 720 
agggaatcta ttgttgatga ggaaagggaa 780 
tagaaaaaaa atctagtaaa ggttttggga 840 
gctaaagaag aagggttaaa agcctttggt 900 

903 



<210> 6 
<211> 614 
<212> ENA 

<213> Arabidopsis thaliana 
<400> 6 

gagggcaagc aaagactaag tttgggggag 
catccatttg tcatcacttt agcatcatat 
tgtcatcact ttgcatgttt aggatagttt 
caggtgattt ggagctgttg acgagctatc 
actcgaccca gaggtcgagt aggagcttca 
cgagtagagc acatcaccac ttcacctcac 
tccatcacca gacggtcact cgatcacttc 
ccatcatcag acaaccactc gacctcctca 
accactcgac tgcatactcg atgacaagct 
gacactcgag cacaaggaag aaaagaagac 



ttgataagtg tgtattttgc atgttttgag 60 
catcactgtt ttataccatt tcacatcatt 120 
tgcatgcatg ttgcatattt gcgttgattt 180 
tggaagaagc agacctgatc atgacaaacc 240 
agate tcaag agactactcg acaaccaggt 300 
cactcgaccc cgaggtcgag tgccatcatc 360 
actcgacctt gaggtcgagt gtcttcacct 420 
ctccacctag aggtcgagta tctccatctt 480 
teagagcett cttaattccg cactcaacca 540 
tccagctatt cactcgagct ctcactcgac 600 



-4- 
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cacgtgggtc gagt 614 



<210> 7 
<211> 1956 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 7 

ttttggttca ccggattaac tcagtgctct cgttgctagt tgtgtgttgc gtagtgaatg 60 
aatttgaaag aaaactgaac catgcctaga ttgctctact cgaccacact gtcatgatct 120 
gataccattc cctatcaatt tgaacctgaa tttgatcttt aattatcatg tctgcatcaa 180 
atttgaactc atggataccc taaaatactt ggattttctt attcattttg atcactcttg 240 
ttaatccaag tagctgactc tccttattag agcagttaac ccgaacccaa acctagactt 300 
tttttcaagc cttatatcac tcgtgagggt ttgtgaggtc ttattccgat tcagcttggt 360 
agaaagtgtt aggttcgtaa cgacagagat agtgnctcat gtagttctag ttcgcatttt 420 
ttggactaga taggactggg tgggcgctta tactttaggt tgggatgngt ttaaaagaaa 480 
aaaaaagggg ttgattcatt gatgagaaaa ggtaaaagac tctaggtgaa gtaagataaa 540 
gaagcagaaa aggtctagta aaggttttgg gatttgtaaa aaaaagaaag agttcttgtt 600 
agctattgaa gatgggcaaa agccctcggt tttaaaatgt taaaaacagg aaccttagtt 660 
gttaaagaaa tccaaatccg ctagatgtat caaagtgttg agaaagcttt tcctagagtt 720 
aagagaaaag aaaagaatga ttagaaaaag ggcttaaagg attcatgaat gcaaagggta 780 
gaggtaagtt cttatactgg gattggagat gggattacca ttagagcttc atctgatata 840 
ctctaggtag atgggatctt atctctgcat gcatagtttg ggacttacct ttagcattct 900 
actaaagctt aatcattttt tgagagatcc cctgttactg aagcctattc tgtaagggac 960 
catctttgtc tcttgacctt ttaccttagc caaatgagtt cattgatgat gcattgcttg 1020 
attcacgttc cagaactaat gaatgttaaa gggattggta gatttgaaaa catgtgtagg 1080 
togagcatat gagtcggatt gattgatagt aaggcatggc taaagttttt cagtagaatt 1140 
cgatcatatc gcagcttaga actatcaact tggacattga tttcatttgg tttatctagt 1200 
gctttggctc tgagtccccg atttcaaacc tcacctctag cttgttctta attgtttgct 1260 
tgagggcaag caaagactaa gtttggggga gttgataagt gtgtattttg catgttttga 1320 
gcatccattt gtcatcactt tagcaccata tcatcactat tttataccat ttctcatcat 1380 
ttgtcatcac tttgcatgtt taggatagtt ttgcatgcat gtggcatatt tgtgttgttt 1440 
tcaagtgatt cggagctgtt gaagaactaa ttggaagaag cggacctgat catgccaaac 1500 
cactcgacct caggtcgagt agacgcttca cgacctcaac acaccactcg accacctggt 1560 
cgagtgtagg acttcaccac ttcacctcat cactcgaccc cctggccgag taccccacga 1620 
gagtcactcg atcacttcac tcgaccccca ggtcgagtgt cttcacctcc accacctgac 1680 
catcactcga tcacacgact ctacctggaa gtcgagtatc accatcacca ccactcgact 1740 
acatacttga tgtcgagctt cagagtcttc tccattccgc actcaaccag acactcgagc 1800 
acaaggaaaa aaagaagatt ctagcttatc actcgacctc tcactcgacc acctgggtcg 1860 
agtacagttc ttaatccgtc tcaatactgc gtcgttttga gtattagggt ttcggaatat 1920 
ttttgctata agtagcacgt actttacatt ttcgag 1956 



<210> 8 
<211> 2105 
<212> ENA 

<213> Arabidopsis thaliana 
<400> 8 

ttagcatttt ggttcactag ataaactcag tgccctcgtt gttagttgtc tgatgcatac 60 
tcaatgaaat tgaagtaaaa ctgcaccatg cctagattgc tctactcgac cacactgtta 120 
ggatctgata ccattcccta tcaatttgaa cttgaatctg atttagaatt atcatgtctt 180 
gccatcgaat ttgaactcat ggatacccta aaatacttgg attttcttac tcattttaac 240 
cactcttgtt aatccaagta gctgactctc cttattagag cagttaaccc gaatccaaac 300 
ctaatctttc tttcgagccc tatatcactt gtgagtgttt gtgaggtctt atttcaattg 360 
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agcttggtag aaagtgttag gttcgtaacg 
cgcgtttttt ggactggata ggactaggtg 
aaaaaaaaaa aaggg t gaat tcattgttga 
aagctaaaga agttagaaaa aaaaaaatct 
aatgaggttc ttgttagcta aagaagaagg 
acaggaacct tagttgttaa agaaatccaa 
gcttctccta gagttaagag aaaagaaaag 
tgagtgcaaa gggtagagtt aagttcttgt 
gcttcattgt tatactatgg gtagatggga 
acctttagca ttctactaaa gctcaatcat 
ttctgtaagg gaccatcttt gtctcttgcc 
tgcattgttt gattcacgtt ccagaactaa 
gcatgtgtag gtcgagtata agagacggat 
cgagtaaaat tcaatcatat cgcatcttag 
ctctatcaga tgctttggtt ctgagtcccc 
cttatttgct tgagggcaag caaagactaa 
catgttttca gtgtccattc atcatcgttt 
tttatatcat ttctcatcat tcttgcatac 
attgcatttc tgagttgttt ttaggtgatt 
cgagccagaa caagaagcca tactcgaccc 
ccatctactc gacccggggg tcgagtaacc 
tcccccctgg tcgagtatca cttcgccaca 
ccacgttact cgaccccctg gtcgagtatc 
gggcactcga tcacatcttc atagtctact 
acaaggaaga gaagaggaga agacaaagtg 
acgaagaagc ccatctcggc ccattatcat 
cgtctactat cattttattt cgttatgtat 
catctagtcg acattgagtt tttttgcttc 
gcttttgttt ctgcaacctg taattcgaga 
atttc 



acagagatag tgtctcatgt agttctagtt 420 
ggcgcttata tcatgggttt ggatcagttt 480 
taaggaaagg gaaagaattc taggggaagt 540 
agtaaaggtt ttgggaatgt taaagaaaag 600 
gttaaaagcc ttttgtttta aagattaaaa 660 
atacgctaga tgtatcagag tgttgagaaa 720 
aatgatatga aaaagagttt gaaagattca 780 
attgggactg gagttgggat taccattaga 840 
ttttatctct gtatgcataa cttgggactt 900 
tcttgagaga tcccctgtta cttaagccta 960 
ttcaccttag ccaaatgagt tcattgatga 1020 
tgaatgttaa agggattggt agatttgaaa 1080 
tgattgataa caaggcatgg ctaacgtttt 1140 
aactaccaac ttggacattg attttatttg 1200 
accttcaaac ctctccttca actatgtctt 1260 
gtttggggga gttgatatgt ctataatttg 1320 
tgagtccagt ttcgtatcat tcatcactgt 1380 
tttgcatgat taggatagct ttgtatacat 1440 
tggagctgtt tgcgagcaaa ttggaagaaa 1500 
cctggtcgag tgactttgga gccattcttc 1560 
tcagctcagg ccactcgatg acgccactcg 1620 
ccacctgacc acactcggcc gttcactcta 1680 
atcactcacc accaacacca tcactcgacc 1740 
caaatccgca ctcaaccaga caagctgagc 1800 
tttggaagcg gcctggacct ccatcggatc 1860 
tctatgggcc gggcgattag gttattggcc 1920 
aaatagatgt cttagggttc tgtaccagga 1980 
agttttattt tctgttttct ctgctgcgcc 2040 
tttttccaag ttattcagat tccgcatttg 2100 

2105 



<210> 9 
<211> 4860 
<212> ENA. 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Virtual TSI 



<400> 9 

tcgccgttcc agcttcgcat actctctcac 
ttcgcctcaa catcttcgcc ggagtttctc 
cactcgacca ccggaccggc ttcaccatct 
cattcactgc gcctccattc gtctctttac 
tctaaattcg ccgtttactc gaccacactg 
ctatacccga agccacaata tcactctact 
ggtttagtgt gtgtgtttat ttgaactaac 
ttttcaggga atcaatatga gcaactacag 
cgtcgatgaa gctgaatctt ggtcaactag 
cttcagagcc gagacccaac gctcagtagc 
tagaggaaag agagcgatga ccagcagata 
gtatgagcct gagtcatggc acagagaaac 
agtggaagag tacatcagac ttttcgagct 
ttatgagact ctagcccagc ttaggctact 
ccatcttgag acgctgatgt cttacccgta 



cgtctctcgt ttcactcgac cacttcacac 60 
gccattgtcc gtgcttccgt catctccgtt 120 
ctcaactatc caccattcac tcgacctcgc 180 
tcgactgctc ctcaaaccgc caccgtcttc 240 
ttacgtctct cattcgtgta cagtcgaccg 300 
cgaccgtttc actcgatcgc gtacttgact 360 
atattgatat ttggttttga gttacattct 420 
tggcgaatcc tccatggatg cggattacaa 480 
accagagaga gagcaacagg cttatgagag 540 
tcgacgcaat gaaaggagag ctgagattgc 600 
tgagttgatc gacgaagata ttgacgtcga 660 
aaaactgttg aacaagcctg atgaagttac 720 
gaacgacttc tggggagcga ggtacccctg 780 
ggaggacgta cagcacttat tcgagaagtg 840 
cgtcgcttac aagaaggaaa caatagagtt 900 
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tctctccact ctgcaagtgg agttgtatca 
agggttggga ttcttgactt tttcagttaa 
cttggaagga ttatttggtt tfccccagtgg 
agagttgaag gatttgtggt taaccattgg 
taagagcaac cagattcgaa gccctgtgat 
tctgtacccc agggaatcta caggcaccgt 
tgcactcaag ggtattctcc ggagaacaaa 
tgatacacca ccggtcatgc ttctgttgat 
caccaacggg aggaagaagg tgcgaggagc 
tctgattgca tgtggtgtac ctctcacgtc 
agatcacttg cgtcgttgtg agtttctgga 
caaattcgag cactccctga cccgaacagc 
aaccatactt tagggtgaga acattgactt 
gagcactcca ccgactgatg acaatgtccc 
tgagacggat gaggataggg aggaggagta 
cgtacctcca gcgcaggaga gcaagagctt 
gcagaggtgg tgcaagaaac aagataggtt 
tctaacggac aagataagtt gcttctcttc 
tcaggacatg ccttcgaaga gatatgacgc 
aagtzcaccac aggcctgagc ctagtgaccg 
atcattcgag cctcgggagc tcgggagaaa 
caggagtaga cgacttctcc agtcccgtag 
aagaagagag gtcgagtatc atcagagcgg 
gtacccccag ggggaagctg agacacaaca 
acaggcagct attgacgacc aactccgctc 
accattgtaa tataccatct cttgtttttt 
gtactctctt ccaaatttgg tcacacagtg 
caggaagtgt gtgttgcatt gtataatctt 
ccaaaaaaat tgaaaaattc cagaaaatga 
cattgcatztt aggatcgagt ttagagtgtt 
ataataatga gatagccttg taagcatttt 
gttagttgtt tgatgcgttg tcattgaaat 
ctctactcga ccacactgtt aggatctgat 
gatttagaat tatcatgtct tggcatcgaa 
gattttctta ctcattttaa ccactcttgt 
gcagttaacc catacccaaa cctgaacttt 
tgtgaggtct tatttcgatt gagcttggta 
gtgtctcatg tagttctagt ttgcgttctt 
atcatgggtt gggatgtgtt taaaagaaaa 
aagaattcca ggggaagtaa gctaaagaag 
aatgttaaag aaaagaatga ggttcttgtt 
ttttaaagat taaaaaaaaa acaggaacct 
tgtatcaaga gcgttgagaa agcttctcct 
aaaaagagtt tgaaagattc atgagtgcaa 
ttttaccatt agaacttcat tgttatactc 
taatttggga cttaccttta gcattctact 
ttacttaagc ctattctata agggaccatc 
gagttcattg atgatgcatt gcttgattcg 
tggtagattt gaaagcatgt gtaggtcgag 
atggctaacg tttttgagta gaattcaatc 
attgatttta tttgctctat catatgcttt 
ttcaactatg tcttcttatt tgcttgaggg 
atgtctataa tttgcatgtt ttcagtgtcc 
tcattcatca ctgttttata tcatttctca 
aactttgcat acatattgca tttctgagtt 
caaattggaa gaaatgagcc agaaccagaa 



gggacttact gcagatgaac tggagagtga 960 
cgagcagcgt taccagctat ctatcaagag 1020 
aaagggaact aaacccaagt tcgaaaggga 1080 
gaacgatttg gcgctcaact ctgcaaggtc 1140 
ccgctactat cagcgctcag tagcgaatgt 1200 
gtctaacaca gacatggaga tgattgattc 1260 
ggggaagaag gtcctaaagg gcgaccttaa 1320 
ccatatgtgt ggatacagga agtgggcgca 1380 
cctttgtgtg ggtggcgttg tgacaccgat 1440 
tccagggttt gatccgagga tgatggattt 1500 
gtacgacatg gttggcgatt tctatcgcta 1560 
caacattttg cttccctgca tcgaggccac 1620 
cagacctgcg cgtgattacc tctactttga 1680 
tacgacggaa gctacagagg atgattttgc 1740 
tgatacgagc atgtatcatt tcagtgagca 1800 
gagtgaagct cacagaaaca acagtaagtt 1860 
acttatcaag tgcttcaagg ccatcacgtt 1920 
taccacagct attccgcagg gagagcgtcc 1980 
gccagggcca agtcatcaca ggcctgagcc 2040 
agtagtacca ccagtccctg caaggcattc 2100 
gaagaaggct gcactcgcta ggtctggcag 2160 
cttacgcgac cgcggtgctg gccgcagcag 2220 
tgctggccgc ggcgaaggag cagaggtcga 2280 
gggagattct tcgatggcct gggagcaatc 2340 
cttcttccac tgaggtatgc acctcactcc 2400 
attttgtttt tgtgatgtgt tttgtcctga 2460 
gactgtgtga tttaagtttg ggggagggct 2520 
gagtttgcat tcatctaagg catagaaaaa 2580 
tttcacaaaa atagagtgtt catgtagttg 2640 
tcgtttagga ttgttgcata tgcatagggg 2700 
ggttcaccag ataagctcag tgccctcgtt 2760 
tgaagtaaga actgcacgat gcctagattg 2820 
atcattccct atcaatttga acttgaatct 2880 
tttgaactca tggataccct aaaatacttg 2940 
tgatccaagt agctgactct ccttattaga 3000 
ctttcaagcc ctatatcact tgtgagtgtt 3060 
gaaagtgtta ggttcgtaac gacagagata 3120 
cagactggat aggactaggt gggcgcttat 3180 
gagggaatct attgttgatg aggaaaggga 3240 
ttagaaaaaa aatctagtaa aggttttggg 3300 
agctaaagaa gaagggttaa aagcctttgg 3360 
tagttgttaa agaaatccaa acccgctaga 3420 
agagttaaga gaaaagaaaa gaatgatatg 3480 
agggtagagt taagttggga caggagttgg 3540 
tgggtagatg ggatcttatc tctgtatgca 3600 
aaagctcaat cattcttgag ggatcccctg 3660 
tttgtctctt gaccttcacc ttggccgaat 3720 
cgttccagaa ctaatgaatg ttaaagggat 3780 
tataagagac ggattgattg aaaacaaggc 3840 
atatcgcatc ttagaactac caacttggac 3900 
ggttttgagt ccccgccttc actcctctcc 3960 
caagcaaaga ctaagtttga gggagttgat 4020 
attcatcatc gttttgagtc cagtttcgta 4080 
tcattcttgc atactttgca tgattaggat 4140 
gttttcaggt gatttggagc tgtttgcaag 4200 
gacatactcg acccctaggt cgagtgactt 4260 



WO 01/20010 



PCT/EPOO/08994 



tggggccatt cttcccacat actcggcccc caggtcgagt gactttggag ccattcttcc 4320 
catccactcg accaccgggt cgagtaacct tagctcaggc cactcgatga cactactcga 4380 
ccccaggtcg agtatcactt cgccacacca cctgacaaca ctcgaccaat cactctacca 4440 
agttactcga ccccctggtc gagtatcatc actcaccacc atcagcatca ctcgaccgga 4500 
cactcgatca cgtcttcaca gtctactcaa atccgcagtc aaccagacaa gctgagcaca 4560 
aggaagagaa gaggagaaga caaagtgctt ggaagcggcc tggacctcca tcggatcacg 4620 
aagcccatct cggcccatta tctctctatg ggccgagcga ttaggttatt ggcccgtcta 4680 
ctatcatttt atttcgtttt gtataaatag atgtcttagg gttttgtcct gagacatcta 4740 
gtcgacattg agtttttttt gcttcagttt tattttctgt tctactctgc tgcgccgctt 4800 
ttgcttctgc aacctgtaat tcgagatttt tccaagttat tcagattccg catttgattt 4860 

<210> 10 
<211> 648 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 10 

Met Ser Asn Tyr Ser Gly Glu Ser Ser Met Asp Ala Asp Tyr Asn Val 
15 10 15 

Asp Glu Ala Glu Ser Trp Ser Thr Arg Pro Glu Arg Glu Gin Gin Ala 
20 25 30 

Tyr Glu Ser Phe Arg Ala Glu Thr Gin Arg Ser Val Ala Arg Arg Asn 
35 40 45 

Glu Arg Arg Ala Glu He Ala Arg Gly Lys Arg Ala Met Thr Ser Arg 
50 55 60 

Tyr Glu Leu He Asp Glu Asp He Asp Val Glu Tyr Glu Pro Glu Ser 
65 70 75 80 

Trp His Arg Glu Thr Lys Leu Leu Asn Lys Pro Asp Glu Val Thr Val 
85 90 ~ 95 

Glu Glu Tyr He Arg Leu Phe Glu Leu Asn Asp Phe Trp Gly Ala Arg 
100 105 110 

Tyr Pro Cys Tyr Glu Thr Leu Ala Gin Leu Arg Leu Leu Glu Asp Val 
115 120 125 

Gin His Leu Phe Glu Lys Cys His Leu Glu Thr Leu Met Ser Tyr Pro 
130 135 140 

Tyr Val Ala Tyr Lys Lys Glu Thr He Glu Phe Leu Ser Thr Leu Gin 
145 150 155 160 

Val Glu Leu Tyr Gin Gly Leu Thr Ala Asp Glu Leu Glu Ser Glu Gly 
165 170 175 

Leu Gly Phe Leu Thr Phe Ser Val Asn Glu Gin Arg Tyr Gin Leu Ser 
180 185 190 

He Lys Ser Leu Glu Gly Leu Phe Gly Phe Pro Ser Gly Lys Gly Thr 
195 200 205 
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Lys Pro Lys Phe Glu Arg Glu Glu Leu Lys Asp Leu Trp Leu Thr lie 
210 215 220 

Gly Asn Asp Leu Ala Leu Asn Ser Ala Arg Ser Lys Ser Asn Gin lie 
225 230 235 240 

Arg Ser Pro Val lie Arg Tyr Tyr Gin Arg Ser Val Ala Asn Val Leu 
245 250 255 

Tyr Pro Arg Glu Ser Thr Gly Thr Val Ser Asn Thr Asp Met Glu Met 
260 265 270 

lie Asp Ser Ala Leu Lys Gly lie Leu Arg Arg Thr Lys Gly Lys Lys 
275 280 285 

Val Leu Lys Gly Asp Leu Asn Asp Thr Pro Pro Val Met Leu Leu Leu 
290 295 300 

lie His Met Cys Gly Tyr Arg Lys Trp Ala His Thr Asn Gly Arg Lys 
305 310 315 320 

Lys Val Arg Gly Ala Leu Cys Val Gly Gly Val Val Thr Pro lie Leu 
325 330 335 

lie Ala Cys Gly Val Pro Leu Thr Ser Pro Gly Phe Asp Pro Arg Met 
340 345 350 

Met Asp Leu Asp His Leu Arg Arg Cys Glu Phe Leu Glu Tyr Asp Met 
355 360 365 

Val Gly Asp Phe Tyr Arg Tyr Lys Phe Glu His Ser Leu Thr Arg Thr 
370 375 380 

Ala Asn lie Leu Leu Pro Cys lie Glu Ala Thr Thr lie Leu Xaa Gly 
385 390 395 400 

Glu Asn lie Asp Phe Arg Pro Ala Arg Asp Tyr Leu Tyr Phe Glu Ser 
405 410 415 

Thr Pro Pro Thr Asp Asp Asn Val Pro Thr Thr Glu Ala Thr Glu Asp 
420 425 430 

Asp Phe Ala Glu Thr Asp Glu Asp Arg Glu Glu Glu Tyr Asp Thr Ser 
435 440 445 

Met Tyr His Phe Ser Glu His Val Pro Pro Ala Gin Glu Ser Lys Ser 
450 455 460 

Leu Ser Glu Ala His Arg Asn Asn Ser Lys Leu Gin Arg Trp Cys Lys 
465 470 475 480 

Lys Gin Asp Arg Leu Leu lie Lys Cys Phe Lys Ala He Thr Phe Leu 
485 490 495 

Thr Asp Lys He Ser Cys Phe Ser Ser Thr Thr Ala He Pro Gin Gly 
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500 505 510 

Glu Arg Pro Gin Asp Met Pro Ser Lys Arg Tyr Asp Ala Pro Gly Pro 
515 520 525 

Ser His His Arg Pro Glu Pro Ser His His Arg Pro Glu Pro Ser Asp 
530 535 540 

Arg Val Val Pro Pro Val Pro Ala Arg His Ser Ser Phe Glu Pro Arg 
545 550 555 560 

Glu Leu Gly Arg Lys Lys Lys Ala Ala Leu Ala Arg Ser Gly Ser Arg 
565 570 575 

Ser Arg Arg Leu Leu Gin Ser Arg Ser Leu Arg Asp Arg Gly Ala Gly 
580 585 590 

Arg Ser Arg Arg Arg Glu Val Glu Tyr His Gin Ser Gly Ala Gly Arg 
595 600 605 

Gly Glu Gly Ala Glu Val Glu Tyr Pro Gin Gly Glu Ala Glu Thr Gin 
610 615 620 

Gin Gly Asp Ser Ser Met Ala Trp Glu Gin Ser Gin Ala Ala lie Asp 
625 630 635 640 

Asp Gin Leu Arg Ser Phe Phe His 
645 



<210> 11 
<211> 28 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 11 

tggttcacca gataagctca gtgccctc 28 



<210> 12 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 12 

cttcagactg gataggacta ggtgggcg 28 
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<210> 13 
<211> 28 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 13 

cgcccaccta gtcctatcca gtctgaag 

<210> 14 
<211> 28 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 14 

cgcatcaaac aactaacaac gagggcac 

<210> 15 
<211> 26 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 15 

cgataacatc gaccgtattg ctcgcc 

<210> 16 
<211> 28 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 16 

aactagctcc catccgtctt cgacatcc 

<210> 17 
<211> 24 
<212> DMA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 17 

tgcatcacac cggattggat tgac 



<210> 18 
<211> 28 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 18 

tgttcccctg aaccatagca atgagacc 



<210> 19 
<211> 27 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 19 

caaacagaca gagtgtggcc caccacc 

<210> 20 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 20 

ca a a cagaca gagtgtggcc caccacc 

<210> 21 
<211> 27 
<212> E8SIA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
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Oligonucleotide sequence 



<400> 21 

tgcaaaccca caggaccaag tctaccc 



27 



<210> 22 
<211> 27 
<212> OSIA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 22 

acagatggtg atagcgtgag cggtggc 27 



<210> 23 
<211> 26 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide sequence 



<210> 24 
<211> 27 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide sequence 

<400> 24 

tctccatcca cgctttcctg aatgtcc 27 



<210> 25 
<211> 29 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide sequence 



<400> 23 

tcaacctttt gccccaacaa ccactc 



26 



<400> 25 

ggagaaggaa gctgaaaatc atattgtgg 



29 
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<210> 26 
<211> 29 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide sequence 

<400> 26 

atgatgatcc taagtctacc cttttgcac 29 



<210> 27 
<211> 3122 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 27 

ttcatatatt cgacctcttc ttctcattct tgcatccaaa agacacaaca agccgccatc 60 
gctttccctc acaactctca ctcgaccacc gcccgctctc tcacttactc ggcttcatcg 120 
ctctcatcgc catctctcaa catactcgac ctcgcgatat cactcgagct cgccgcttct 180 
caccgcctct ccatcgtcac cgcctgctcc ctctctccaa ggaaacaact cgagctctcc 240 
atttcactca ctcgacctct accaccaagc cggcttcacc acttctagct cttaaccact 300 
cgaccacctt caccatcaac caatcaaatc gttttctcct ccattaaagc ttgacatact 360 
cgaccgctga acacttatca cc ttcaagct cctcatctct tcatcgtttc caacaccgct 420 
gctctcatcc cccacgaaag cttgtcatca cctctcactc atcaccagtt cactcgattc 480 
agcaaccaaa ctcgacctcg tctctcttgc cactcatagt cactcgatct ctcctcacca 540 
tcttcatcat ctcccttact cgaccaccgt gcgtctcgct ccaccattgc catttaaaag 600 
ctcactcgat tgtcaaagag aagaagagtg aagctcaacc accgccactc gaccgcgttt 660 
ccctctacac attcaacact cgaccacggt gctaccatct ccacacccgc tcttgttcac 720 
catacactcg accaacaact ctcaaagtaa aaaaaaaaag aaaaaaaaag tcaaaaccga 780 
cagtttcact caaccggttt actcgaccgg tacgctggtt tagattgtgt ttttggtttt 840 
gctattacta acatattaac gtttatcttt gagtttcgtc tgtttttagg tttcatcatg 900 
agtaactaca gtggaaaatc ctctatggac cctgattata atgtggatga agctaagtcc 960 
tggtccacta gaccggagtg agagcaacat gtttacgaga gctataggga tgaatttgaa 1020 
cgctctgcag ctcgacgtaa tcaaagaaga gctgaaatcg ctagaggaaa gagggcgatg 1080 
tcgagtagat atgagctgat tgatgaggat atcaaaactg agtatgagcc agagtcatgg 1140 
cgcaaggaga cgaagctact gaacaaatcc gacgaggtta cagtggagga gtatatcaga 1200 
ttctttgaga tgaatgactt ctggggaacg aggtatccct gatatgagac tttagcccag 1260 
ttggggttac tggaggacgt gcagcatctg ttcgagaagt gtcatctgat aaggaggaga 1320 
caatcgagtt tctttccaca ctgcaagtgg aaatgtatga gggactcaca gactttgagc 1380 
tggataccat ggggttaggc ttcttgacgt tcttagtgga tgaacagcgg taccagattt 1440 
agatcaagaa attggaagaa ctgtttggtt tccctagtgg aaagggaacc aaccccaggt 1500 
ttgacaggga agagcttaag gatttgtggg ctactattgg gaacaatcta ccgctaaact 1560 
cgacgcggtc caagagcaac caaatccgga gtcctgtgat tcgctacttt cagcgctcgg 1620 
ttgccaatgt tttttactcc agggagtcta caggcaccgt gtctaacaca gacatgaaga 1680 
tgatagattc agcgcttata gggattctcc gccttacaaa aggaaagaat gtcctgagag 1740 
gagatcttaa cgactcacca ccagtaatgc ctctgttgat ccatctgtgt gggtacatga 1800 
agtgggcgct gacaaacggc aagaagaagg taagaggagc actatgcgtg ggtggcgttg 1860 
tgacgccaat tctgaaagtt tgtggagttc cgctcaagga agtagggtta gcaccgagaa 1920 
tgatggactt ggatcacttg cgccgatgtg agttctctga gtttgacatg gttggcgact 1980 
ttcaccgcta caggttcgag cattcatcga ttagaatcgc caacattctt ttcccctgca 2040 
tttacgctac taggattctc gagggcagga acattgactt caagcctgcg cttgaagatc 2100 
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tttatttcga gggcagtccg ccaactgagg agattagtca caccgaagga gctacaatag 2160 
aagatgttga tgagacatat gatatagatg aggcggagtt tgacacgagc atgtatcatt 2220 
tcagtgagca tatacctcca gcgaggaaaa gcaagagttt gagcgaagct cacaggaaca 2280 
acagcaagct gcagaagtgg tgcaagaaac aggataagtt actcgccaag tgcctcaggg 2340 
ctatcaagtt tctgaaggac aagatcagct gc tec tefcte cactacaact attcegcaat 2400 
gacagctccc tcaggacatg ccttcgagga gatatgaege gcccgagcct agagagcaga 2460 
agattctgea tgtccctgcg aggcattcat cattcgagcc tegtgaatet aggaagaata 2520 
ggagaacgac actcactcga tctagcagca ggagcagacg acttctgeag tetegtagtt 2580 
tacgcgaccg cggtgctggc cgcaatagaa gaagagaggt cgagtatcct cagageggtg 2640 
ctggccgcca cagagctgat gagatcgagt acccacatgc tggagctgat aeggaacatg 2700 
gcggttcgtc tatggcttgg gagcaatcac aggcagecat tgactaccaa ettegttcat 2760 
tattcgactg aggtaagege ctcacttcac cattatatta tatcatctct tgtgatttgt 2820 
tctttatttt gtttcagtga ttggatttgt cctgagtact ctcttccaag tttattcaca 2880 
cagtggactg tgtgatttaa gtttggggga gggctcagga agtatgttgc attgtatata 2940 
tttttaagtc tgeattcate taaggcatag aaaaaccaaa aaaaaattaa aaatttcaga 3000 
aaatgatttc acaaaaaaag agtgttcatg tagttgeatt acatttagga tcaagtctag 3060 
agtgtttcat ttaggattgt tgeatatgea taggggataa tgatgagata gccttgtaag 3120 
ca 3122 
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SEQUENCE LISTING 



<110> Syngenta Participations AG 



/ 

JC10 Rec'd PC77PTO 
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<120> Transcriptionally Silenced Plant Genes 

<130> S-31147A 

<140> 
<141> 

<150> GB 9921964.4 
<151> 1999-09-16 

<160> 26 

<170> Patentin ver. 2.1 

<210> 1 
<211> 2512 
<212> DNA 

<213> Arabidopsis thai i ana 



<400> 1 

tcgccgttcc 

ttcgcctcaa 

cactcgacca 

cattcactgc 

tctaaattcg 

ctatacccga 

ggtttagtgt 

ttttcaggga 

cgtcgatgaa 

cttcagagcc 

tagaggaaag 

gtatgagcct 

agtggaagag 

ttatgagact 

ccatcttgag 

tctctccact 

agggttggga 

cttggaagga 

agagttgaag 

taagagcaac 

tctgtacccc 

tgcactcaag 

tgatacacca 

caccaacggg 

tctgattgca 

agatcacttg 

caaattcgag 

aaccatactt 

gagcactcca 

tgagacggat 

cgtacctcca 

gcagaggtgg 

tctaacggac 

tcaggacatg 

aagtcaccac 

atcattcgag 

caggagtaga 

aagaagagag 



agcttcgcat 
catcttcgcc 
ccggaccggc 
gcctccattc 
ccgtttactc 
agccacaata 
gtgtgtttat 
atcaatatga 
gctgaatctt 
gagacccaac 
agagcgatga 
gagtcatggc 
tacatcagac 
ctagcccagc 
acgctgatgt 
ctgcaagtgg 
ttcttgactt 
ttatttggtt 
gatttgtggt 
cagattcgaa 
agggaatcta 
ggtattctcc 
ccggtcatgc 
aggaagaagg 
tgtggtgtac 
cgtcgttgtg 
cactccctga 
tagggtgaga 
ccgactgatg 
gaggataggg 
gcgcaggaga 
tgcaagaaac 
aagataagtt 
ccttcgaaga 
aggcctgagc 
cctcgggagc 
cgacttctcc 
gtcgagtatc 



actctctcac 
ggagtttctc 
ttcaccatct 
gtctctttac 
gaccacactg 
tcactctact 
ttgaactaac 
gcaactacag 
ggtcaactag 
gctcagtagc 
ccagcagata 
acagagaaac 
ttttcgagct 
ttaggctact 
cttacccgta 
agttgtatca 
tttcagttaa 
ttcccagtgg 
taaccattgg 
gccctgtgat 
caggcaccgt 
ggagaacaaa 
ttctgttgat 
tgcgaggagc 
ctctcacgtc 
agtttctgga 
cccgaacagc 
acattgactt 
acaatgtccc 
aggaggagta 
gcaagagctt 
aagataggtt 
gcttctcttc 
gatatgacgc 
ctagtgaccg 
tcgggagaaa 
agtcccgtag 
atcagagcgg 



cgtctctcgt 
gccattgtcc 
ctcaactatc 
tcgactgctc 
ttacgtctct 
cgaccgtttc 
atattgatat 
tggcgaatcc 
accagagaga 
tcgacgcaat 
tgagttgatc 
aaaactgttg 
gaacgacttc 
ggaggacgta 
cgtcgcttac 
gggacttact 
cgagcagcgt 
aaagggaact 
gaacgatttg 
ccgctactat 
gtctaacaca 
ggggaagaag 
ccatatgtgt 
cctttgtgtg 
tccagggttt 
gtacgacatg 
caacattttg 
cagacctgcg 
tacgacggaa 
tgatacgagc 
gagtgaagct 
acttatcaag 
taccacagct 
gccagggcca 
agtagtacca 
gaagaaggct 
cttacgcgac 
tgctggccgc 
Page 



ttcactcgac 
gtgcttccgt 
caccattcac 
ctcaaaccgc 
cattcgtgta 
actcgatcgc 
ttggttttga 
tccatggatg 
gagcaacagg 
gaaaggagag 
gacgaagata 
aacaagcctg 
tggggagcga 
cagcacttat 
aagaaggaaa 
gcagatgaac 
taccagctat 
aaacccaagt 
gcgctcaact 
cagcgctcag 
gacatggaga 
gtcctaaagg 
ggatacagga 
ggtggcgttg 
gatccgagga 
gttggcgatt 
cttccctgca 
cgtgattacc 
gctacagagg 
atgtatcatt 
cacagaaaca 
tgcttcaagg 
attccgcagg 
agtcatcaca 
ccagtccctg 
gcactcgcta 
cgcggtgctg 
ggcgaaggag 
1 



cacttcacac 
catctccgtt 
tcgacctcgc 
caccgtcttc 
cagtcgaccg 
gtacttgact 
gttacattct 
cggattacaa 
cttatgagag 
ctgagattgc 
ttgacgtcga 
atgaagttac 
ggtacccctg 
tcgagaagtg 
caatagagtt 
tggagagtga 
ctatcaagag 
tcgaaaggga 
ctgcaaggtc 
tagcgaatgt 
tgattgattc 
gcgaccttaa 
agtgggcgca 
tgacaccgat 
tgatggattt 
tctatcgcta 
tcgaggccac 
tctactttga 
atgattttgc 
tcagtgagca 
acagtaagtt 
ccatcacgtt 
gagagcgtcc 
ggcctgagcc 
caaggcattc 
ggtctggcag 
gccgcagcag 
cagaggtcga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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gtacccccag ggggaagctg agacacaaca 
acaggcagct attgacgacc aactccgctc 
accattgtaa tataccatct cttgtttttt 
gtactctctt ccaaatttgg tcacacagtg 
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gggagattct tcgatggcct gggagcaatc 2340 
cttcttccac tgaggtatgc acctcactcc 2400 
attttgtttt tgtgatgtgt tttgtcctga 2460 
gactgtgtga tttaagtttg gg 2512 



<210> 2 
<211> 1997 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 2 

ccgggcaggt caacaggctt atgagagctt 
acgcaatgaa aggagagctg agattgctag 
gttgatcgac gaagatattg acgtcgagta 
gctgttgaac aagcccgatg aagttacagt 
cgacttctag ggaacgaggt acccctgtta 
ggacgtacat cacttattcg agaagtgcca 
cgcttacaag aaggaaacaa tagagtttct 
acttactgca gatgagctgg agagtgaagg 
gcagcgttac cagctatcta tcaagagctt 
gggaactaaa cccaagttcg agagggaaga 
cgatttggca ctcaactctg caaggtctaa 
ctactatcag cgctcagtag cgaatgttct 
taacacagac atggagatga ttgatgttgc 
gaagaaggtc ctaaagggcg accttaatga 
cctgtgtgga tacaggaagt gggcgcacac 
ttgtgtaggt ggcgttgtga caccgattct 
agggtttgat ccgaggatga tggatttaga 
cgacatggtt ggcgatttct atcgctacaa 
cattttgctt ccctgcatcg aggccacaac 
acctgcgcgt gattacctct actttgagag 
gacggaagtt acagaggatg atattgctga 
tacgagcatg tatcatttca gtgagcacgt 
tgaagctcac agaaacaaca gtaagttgca 
tatcaagtgc ttcaaagcca tcacgtttct 
cacagctatt ccgcagggag agcatcctca 
agcgccaagt catcacaggc ctgagccaag 
agtcccacca gtccctgcaa ggcattcatc 
gaaggctgca ctcgctcggt ctggcagcag 
acgcgaccgt ggtgctggcc gcagcagaag 
tggccgcgac gaaggagcag aggtcgagta 
gattcttcga tggcctggga gcaatcacat 
ttccactgag gtatgcacct cactccacca 
gtttttgtga tgtgttttgt cctgagtact 
tgtgatttaa gtttggg 



tagagctgag acccaacgct cagtagctcg 60 
aggaaagaga gcaatgacca gcagatatga 120 
tgaacctgaa tcatggcaca gagaaacgaa 180 
agaggagtac atcagacttt tcgagctgaa 240 
tgagacttta gcccagctgg ggctactgga 300 
tctggagacg ctgatgtctt acccgtacgt 360 
ctccactctg caagtggaga tgtatcaggg 420 
gttggggttc ttgacttttt cagttaacga 480 
ggaaggatta tttggttttc caagtggaaa 540 
gttgaaagat ttgtggttaa ccattgggaa 600 
gagcaaccag attcgaagcc ctgtgatccg 660 
gtaccccagg gaatctacag gcatcgtgtc 720 
actcaagggc attctccgga gaacaaaggg 780 
tacaccaccg gttatgcttc tgttgatcca 840 
caacgagaag aagaaggtgc gaggagccct 900 
gattgcatgt ggtgtacctc tcacgtctcc 960 
tcacttgcgt cgttgtgagt ttctagagta 1020 
attcgagcac tccctgaccc gaacagccaa 1080 
catacttcag ggtgagaaca ttgacttcag 1140 
cgctccaccg actgatgaca atgtccctac 1200 
gacggatgag gatagggagg aggagtatga 1260 
acctccagcg cgggagagca agagcttgag 1320 
gaggtggtgc aagaaacaag ataggctact 1380 
gacggacaag ataagctgct tctcttctac 1440 
ggacatgcct tcaaggagat atgacgcgcc 1500 
tcaccacagg cctgagccta gtgaccgagt 1560 
attcgagcct cgggagctcg ggagaaagaa 1620 
gagtacacga cttctacagt cccgtagctt 1680 
aagagaggtc gagtatcatc agagcggtgc 1740 
ccccacgggg aagctgagac acaacaggga 1800 
gcagctattg acgaccaact ccgctccttc 1860 
ttgtaatata ccatctcttg tttttatttt 1920 
ctcttccaaa tttggtcaca cagtggactg 1980 



<210> 3 
<211> 1682 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 3 

cttatattat gggttgggat gtgtttaaag 
gggaaagaat tctaggggaa gtaagctaaa 
ggaatgttaa agaaaagaat gaggttcttg 
ggttttaaag attaaaaaaa aaacaggaac 
gatgtatcaa gagcgttgag aaagcttctc 
tgaaaaagag tttgaaagat tcatgagtgc 
ggttttacca ttagaacttc attgttatac 
cataatttgg gacttacctt tagcattcta 
tgttacttaa gcctattcta taagggacca 



aaaaggggga attcattgtt gataaggaaa 60 
gaagttagaa aaaatctagt aaaggttttg 120 
ttagctaaag aataagggtt aaaagccttt 180 
cttagttgtt aaagaaatcc aaacccgcta 240 
ctagagttaa gagaaaagaa aagaatgata 300 
aaagggtaga gttaagttgg gacaggagtt 360 
tctgggtaga tgggatctta tctctgtatg 420 
ctaaagctca atcattcttg agggatcccc 480 
tctttgtctc ttgaccttca ccttggccga 540 
Page 2 



atgagttcat tgatgatgca ttgcttgatt 
attggtagat ttgaaagcat gtgtaggtcg 
gcatggctaa cgtttttgag tagaattcaa 
acattgattt tatttgctct atcatatgct 
ccttcaacta tgtcttctta tttgcttgag 
atatgtctat aatttgcatg ttttcagtgt 
tatcattcat cactgtttta tatcatttct 
ataactttgc atacatattg catttctgag 
agcaaattgg aagaaatgag ccagaaccag 
tttggggcca ttcttcccac atactcggcc 
cccatccact cgaccaccgg gtcgagtaac 
gaccccaggt cgagtatcac ttcgccacac 
caagttactc gaccccctgg tcgagtatca 
gacactcgat cacgtcttca cagtctactc 
caaggaagag aagaggagaa gacaaagtgc 
cgaagcccat ctcggcccat tatctctcta 
tactatcatt ttatttcgtt ttgtataaat 
tagtcgacat tgagtttttt ttgcttcagt 
ttttgcttct gcaacctgta attcgagatt 
tt 
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cgcgttccag aactaatgaa tgttaaaggg 600 
agtataagag acggattgat tgaaaacaag 660 
tcatatcgca tcttagaact accaacttgg 720 
ttggttttga gtccccgcct tcactcctct 780 
ggcaagcaaa gactaagttt gagggagttg 840 
ccattcatca tcgttttgag tccagtttcg 900 
catcattctt gcatactttg catgattagg 960 
ttgttttcag gtgatttgga gctgtttgca 1020 
aagacatact cgacccctag gtcgagtgac 1080 
cccaggtcga gtgactttgg agccattctt 1140 
cttagctcag gccactcgat gacactactc 1200 
cacctgacaa cactcgacca atcactctac 1260 
tcactcacca ccatcagcat cactcgaccg 1320 
aaatccgcag tcaaccagac aagctgagca 1380 
ttggaagcgg cctggacctc catcggatca 1440 
tgggccgagc gattaggtta ttggcccgtc 1500 
agatgtctta gggttttgtc ctgagacatc 1560 
tttattttct gttctactct gctgcgccgc 1620 
tttccaagtt attcagattc cgcatttgat 1680 

1682 



<210> 4 
<211> 1652 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 4 

cttatatcat gggtttggat cagtttaaaa aaaaaaaagg gtgaattcat tgttgataag 60 
gaaagggaaa gaattctagg ggaagtaagc taaagaagtt agaaaaaaaa aaatctagta 120 
aaggttttgg gaatgttaaa gaaaagaatg aggttcttgt tagctaaaga agaagggtta 180 
aaagcctttt gttttaaaga ttaaaaacag gaaccttagt tgttaaagaa atccaaatac 240 
gctagatgta tcagagtgtt gagaaagctt ctcctagagt taagagaaaa gaaaagaatg 300 
atatgaaaaa gagtttgaaa gattcatgag tgcaaagggt agagttaagt tcttgtattg 360 
ggactggagt tgggattacc attagagctt cattgttata ctatgggtag atgggatttt 420 
atctctgtat gcataacttg ggacttacct ttagcattct actaaagctc aatcattctt 480 
gagagatccc ctgttactta agcctattct gtaagggacc atctttgtct cttgaccttc 540 
accttagcca aatgagttca ttgatgatgc attgtttgat tcacgttcca gaactaatga 600 
atgttaaagg gattggtaga tttgaaagca tgtgtaggtc gagtataaga gacggattga 660 
ttgataacaa ggcatggcta acgttttcga gtaaaattca atcatatcgc atcttagaac 720 
taccaacttg gacattgatt ttatttgctc tatcagatgc tttggttctg agtccccacc 780 
ttcaaacctc tccttcaact atgtcttctt atttgcttga gggcaagcaa agactaagtt 840 
tgggggagtt gatatgtcta taatttgcat gttttcagtg tccattcatc atcgttttga 900 
gtccagtttc gtatcattca tcactgtttt atatcatttc tcatcattct tgcatacttt 960 
gcatgattag gatagctttg tatacatatt gcatttctga gttgttttta ggtgatttgg 1020 
agctgtttgc gagcaaattg gaagaaacga gccagaacaa gaagccatac tcgaccccct 1080 
ggtcgagtga ctttggagcc attcttccca tctactcgac ccgggggtcg agtaacctca 1140 
gctcaggcca ctcgatgacg ccactcgtcc cccctggtcg agtatcactt cgccacacca 1200 
cctgaccaca ctcggccgtt cactctacca cgttactcga ccccctggtc gagtatcatc 1260 
actcaccacc aacaccatca ctcgaccggg cactcgatca catcttcata gtctactcaa 1320 
atccgcactc aaccagacaa gctgagcaca aggaagagaa gaggagaaga caaagtgttt 1380 
ggaagcggcc tggacctcca tcggatcacg aagaagccca tctcggccca ttatcattct 1440 
atgggccggg cgattaggtt attggcccgt ctactatcat tttatttcgt tatgtataaa 1500 
tagatgtctt agggttctgt accaggacat ctagtcgaca ttgagttttt ttgcttcagt 1560 
tttattttct gttttctctg ctgcgccgct tttgtttctg caacctgtaa ttcgagattt 1620 
ttccaagtta ctcagattcc gcatttgatt tc 1652 



<210> 5 
<211> 903 
<212> DNA 

<213> Arabidopsis thai i ana 
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<400> 5 

tactctcttc 

aggaagtgtg 

caaaaaaatt 

attgcattta 

taataatgag 

ttagttgttt 

tctactcgac 

atttagaatt 

attttcttac 

cagttaaccc 

gtgaggtctt 

tgtctcatgt 

tcatgggttg 

agaattccag 

atgttaaaga 

ttt 



caaatttggt 
tgttgcattg 
gaaaaattcc 
ggatcgagtt 
atagccttgt 
gatgcgttgt 
cacactgtta 
atcatgtctt 
tcattttaac 
atacccaaac 
atttcgattg 
agttctagtt 
ggatgtgttt 
gggaagtaag 
aaagaatgag 



cacacagtgg 
tataatcttg 
agaaaatgat 
tagagtgttt 
aagcattttg 
cattgaaatt 
ggatctgata 
ggcatcgaat 
cactcttgtt 
ctgaactttc 
agcttggtag 
tgcgttcttc 
aaaagaaaag 
ctaaagaagt 
gttcttgtta 



actgtgtgat 
agtttgcatt 
ttcacaaaaa 
cgtttaggat 
gttcaccaga 
gaagtaagaa 
tcattcccta 
ttgaactcat 
gatccaagta 
tttcaagccc 
aaagtgttag 
agactggata 
agggaatcta 
tagaaaaaaa 
gctaaagaag 



ttaagtttgg 
catctaaggc 
tagagtgttc 
tgttgcatat 
taagctcagt 
ctgcacgatg 
tcaatttgaa 
ggatacccta 
gctgactctc 
tatatcactt 
gttcgtaacg 
ggactaggtg 
ttgttgatga 
atctagtaaa 
aagggttaaa 



gggagggctc 60 
atagaaaaac 120 
atgtagttgc 180 
gcatagggga 240 
gccctcgttg 300 
cctagattgc 360 
cttgaatctg 420 
aaatacttgg 480 
cttattagag 540 
gtgagtgttt 600 
acagagatag 660 
ggcgcttata 720 
ggaaagggaa 780 
ggttttggga 840 
agcctttggt 900 
903 



<210> 6 
<211> 614 
<212> DNA 

<213> Arabidopsis thai i ana 



<400> 6 

gagggcaagc 

catccatttg 

tgtcatcact 

caggtgattt 

actcgaccca 

cgagtagagc 

tccatcacca 

ccatcatcag 

accactcgac 

gacactcgag 

cacgtgggtc 



aaagactaag 
tcatcacttt 
ttgcatgttt 
ggagctgttg 
gaggtcgagt 
acatcaccac 
gacggtcact 
acaaccactc 
tgcatactcg 
cacaaggaag 
gagt 



tttgggggag 
agcatcatat 
aggatagttt 
acgagctatc 
aggagcttca 
ttcacctcac 
cgatcacttc 
gacctcctca 
atgacaagct 
aaaagaagac 



ttgataagtg 
catcactgtt 
tgcatgcatg 
tggaagaagc 
agatctcaag 
cactcgaccc 
actcgacctt 
ctccacctag 
tcagagcctt 
tccagctatt 



tgtattttgc 
ttataccatt 
ttgcatattt 
agacctgatc 
agactactcg 
cgaggtcgag 
gaggtcgagt 
aggtcgagta 
cttaattccg 
cactcgagct 



atgttttgag 60 
tcacatcatt 120 
gcgttgattt 180 
atgacaaacc 240 
acaaccaggt 300 
tgccatcatc 360 
gtcttcacct 420 
tctccatctt 480 
cactcaacca 540 
ctcactcgac 600 
614 



<210> 7 
<211> 1956 
<212> DNA 

<213> Arabidopsis thai i ana 



<400> 7 

ttttggttca 

aatttgaaag 

gataccattc 

atttgaactc 

ttaatccaag 

tttttcaagc 

agaaagtgtt 

ttggactaga 

aaaaaagggg 

gaagcagaaa 

agctattgaa 

gttaaagaaa 

aagagaaaag 

gaggtaagtt 

ctctaggtag 

actaaagctt 

catctttgtc 

attcacgttc 

tcgagcatat 



ccggattaac 
aaaactgaac 
cctatcaatt 
atggataccc 
tagctgactc 
cttatatcac 
aggttcgtaa 
taggactggg 
ttgattcatt 
aggtctagta 
gatgggcaaa 
tccaaatccg 
aaaagaatga 
cttatactgg 
atgggatctt 
aatcattttt 
tcttgacctt 
cagaactaat 
gagtcggatt 



tcagtgctct 
catgcctaga 
tgaacctgaa 
taaaatactt 
tccttattag 
tcgtgagggt 
cgacagagat 
tgggcgctta 
gatgagaaaa 
aaggttttgg 
agccctcggt 
ctagatgtat 
ttagaaaaag 
gattggagat 
atctctgcat 
tgagagatcc 
ttaccttagc 
gaatgttaaa 
gattgatagt 



cgttgctagt 
ttgctctact 
tttgatcttt 
ggattttctt 
agcagttaac 
ttgtgaggtc 
agtgnctcat 
tactttaggt 
ggtaaaagac 
gatttgtaaa 
tttaaaatgt 
caaagtgttg 
ggcttaaagg 
gggattacca 
gcatagtttg 
cctgttactg 
caaatgagtt 
gggattggta 
aaggcatggc 
page 



tgtgtgttgc 
cgaccacact 
aattatcatg 
attcattttg 
ccgaacccaa 
ttattccgat 
gtagttctag 
tgggatgngt 
tctaggtgaa 
aaaaagaaag 
taaaaacagg 
agaaagcttt 
attcatgaat 
ttagagcttc 
ggacttacct 
aagcctattc 
cattgatgat 
gatttgaaaa 
taaagttttt 
4 



gtagtgaatg 
gtcatgatct 
tctgcatcaa 
atcactcttg 
acctagactt 
tcagcttggt 
ttcgcatttt 
ttaaaagaaa 
gtaagataaa 
agttcttgtt 
aaccttagtt 
tcctagagtt 
gcaaagggta 
atctgatata 
ttagcattct 
tgtaagggac 
gcattgcttg 
catgtgtagg 
cagtagaatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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cgatcatatc gcagcttaga actatcaact 
gctttggctc tgagtccccg atttcaaacc 
tgagggcaag caaagactaa gtttggggga 
gcatccattt gtcatcactt tagcaccata 
ttgtcatcac tttgcatgtt taggatagtt 
tcaagtgatt cggagctgtt gaagaactaa 
cactcgacct caggtcgagt agacgcttca 
cgagtgtagg acttcaccac ttcacctcat 
gagtcactcg atcacttcac tcgaccccca 
catcactcga tcacacgact ctacctggaa 
acatacttga tgtcgagctt cagagtcttc 
acaaggaaaa aaagaagatt ctagcttatc 
agtacagttc ttaatccgtc tcaatactgc 
ttttgctata agtagcacgt actttacatt 
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tggacattga tttcatttgg tttatctagt 1200 
tcacctctag cttgttctta attgtttgct 1260 
gttgataagt gtgtattttg catgttttga 1320 
tcatcactat tttataccat ttctcatcat 1380 
ttgcatgcat gtggcatatt tgtgttgttt 1440 
ttggaagaag cggacctgat catgccaaac 1500 
cgacctcaac acaccactcg accacctggt 1560 
cactcgaccc cctggccgag taccccacga 1620 
ggtcgagtgt cttcacctcc accacctgac 1680 
gtcgagtatc accatcacca ccactcgact 1740 
tccattccgc actcaaccag acactcgagc 1800 
actcgacctc tcactcgacc acctgggtcg 1860 
gtcgttttga gtattagggt ttcggaatat 1920 
ttcgag 1956 



<210> 8 
<211> 2105 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 8 

ttagcatttt ggttcactag ataaactcag 
tcaatgaaat tgaagtaaaa ctgcaccatg 
ggatctgata ccattcccta tcaatttgaa 
gccatcgaat ttgaactcat ggatacccta 
cactcttgtt aatccaagta gctgactctc 
ctaatctttc tttcgagccc tatatcactt 
agcttggtag aaagtgttag gttcgtaacg 
cgcgtttttt ggactggata ggactaggtg 
aaaaaaaaaa aagggtgaat tcattgttga 
aagctaaaga agttagaaaa aaaaaaatct 
aatgaggttc ttgttagcta aagaagaagg 
acaggaacct tagttgttaa agaaatccaa 
gcttctccta gagttaagag aaaagaaaag 
tgagtgcaaa gggtagagtt aagttcttgt 
gcttcattgt tatactatgg gtagatggga 
acctttagca ttctactaaa gctcaatcat 
ttctgtaagg gaccatcttt gtctcttgcc 
tgcattgttt gattcacgtt ccagaactaa 
gcatgtgtag gtcgagtata agagacggat 
cgagtaaaat tcaatcatat cgcatcttag 
ctctatcaga tgctttggtt ctgagtcccc 
cttatttgct tgagggcaag caaagactaa 
catgttttca gtgtccattc atcatcgttt 
tttatatcat ttctcatcat tcttgcatac 
attgcatttc tgagttgttt ttaggtgatt 
cgagccagaa caagaagcca tactcgaccc 
ccatctactc gacccggggg tcgagtaacc 
tcccccctgg tcgagtatca cttcgccaca 
ccacgttact cgaccccctg gtcgagtatc 
gggcactcga tcacatcttc atagtctact 
acaaggaaga gaagaggaga agacaaagtg 
acgaagaagc ccatctcggc ccattatcat 
cgtctactat cattttattt cgttatgtat 
catctagtcg acattgagtt tttttgcttc 
gcttttgttt ctgcaacctg taattcgaga 
atttc 



tgccctcgtt gttagttgtc tgatgcatac 60 
cctagattgc tctactcgac cacactgtta 120 
cttgaatctg atttagaatt atcatgtctt 180 
aaatacttgg attttcttac tcattttaac 240 
cttattagag cagttaaccc gaatccaaac 300 
gtgagtgttt gtgaggtctt atttcaattg 360 
acagagatag tgtctcatgt agttctagtt 420 
ggcgcttata tcatgggttt ggatcagttt 480 
taaggaaagg gaaagaattc taggggaagt 540 
agtaaaggtt ttgggaatgt taaagaaaag 600 
gttaaaagcc ttttgtttta aagattaaaa 660 
atacgctaga tgtatcagag tgttgagaaa 720 
aatgatatga aaaagagttt gaaagattca 780 
attgggactg gagttgggat taccattaga 840 
ttttatctct gtatgcataa cttgggactt 900 
tcttgagaga tcccctgtta cttaagccta 960 
ttcaccttag ccaaatgagt tcattgatga 1020 
tgaatgttaa agggattggt agatttgaaa 1080 
tgattgataa caaggcatgg ctaacgtttt 1140 
aactaccaac ttggacattg attttatttg 1200 
accttcaaac ctctccttca actatgtctt 1260 
gtttggggga gttgatatgt ctataatttg 1320 
tgagtccagt ttcgtatcat tcatcactgt 1380 
tttgcatgat taggatagct ttgtatacat 1440 
tggagctgtt tgcgagcaaa ttggaagaaa 1500 
cctggtcgag tgactttgga gccattcttc 1560 
tcagctcagg ccactcgatg acgccactcg 1620 
ccacctgacc acactcggcc gttcactcta 1680 
atcactcacc accaacacca tcactcgacc 1740 
caaatccgca ctcaaccaga caagctgagc 1800 
tttggaagcg gcctggacct ccatcggatc 1860 
tctatgggcc gggcgattag gttattggcc 1920 
aaatagatgt cttagggttc tgtaccagga 1980 
agttttattt tctgttttct ctgctgcgcc 2040 
tttttccaag ttattcagat tccgcatttg 2100 



<210> 9 
<211> 4860 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Virtual TSI 
<400> 9 

tcgccgttcc agcttcgcat actctctcac cgtctctcgt ttcactcgac cacttcacac 60 
ttcgcctcaa catcttcgcc ggagtttctc gccattgtcc gtgcttccgt catctccgtt 120 
cactcgacca ccggaccggc ttcaccatct ctcaactatc caccattcac tcgacctcgc 180 
cattcactgc gcctccattc gtctctttac tcgactgctc ctcaaaccgc caccgtcttc 240 
tctaaattcg ccgtttactc gaccacactg ttacgtctct cattcgtgta cagtcgaccg 300 
ctatacccga agccacaata tcactctact cgaccgtttc actcgatcgc gtacttgact 360 
ggtttagtgt gtgtgtttat ttgaactaac atattgatat ttggttttga gttacattct 420 
ttttcaggga atcaatatga gcaactacag tggcgaatcc tccatggatg cggattacaa 480 
cgtcgatgaa gctgaatctt ggtcaactag accagagaga gagcaacagg cttatgagag 540 
cttcagagcc gagacccaac gctcagtagc tcgacgcaat gaaaggagag ctgagattgc 600 
tagaggaaag agagcgatga ccagcagata tgagttgatc gacgaagata ttgacgtcga 660 
gtatgagcct gagtcatggc acagagaaac aaaactgttg aacaagcctg atgaagttac 720 
agtggaagag tacatcagac ttttcgagct gaacgacttc tggggagcga ggtacccctg 780 
ttatgagact ctagcccagc ttaggctact ggaggacgta cagcacttat tcgagaagtg 840 
ccatcttgag acgctgatgt cttacccgta cgtcgcttac aagaaggaaa caatagagtt 900 
tctctccact ctgcaagtgg agttgtatca gggacttact gcagatgaac tggagagtga 960 
agggttggga ttcttgactt tttcagttaa cgagcagcgt taccagctat ctatcaagag 1020 
cttggaagga ttatttggtt ttcccagtgg aaagggaact aaacccaagt tcgaaaggga 1080 
agagttgaag gatttgtggt taaccattgg gaacgatttg gcgctcaact ctgcaaggtc 1140 
taagagcaac cagattcgaa gccctgtgat ccgctactat cagcgctcag tagcgaatgt 1200 
tctgtacccc agggaatcta caggcaccgt gtctaacaca gacatggaga tgattgattc 1260 
tgcactcaag ggtattctcc ggagaacaaa ggggaagaag gtcctaaagg gcgaccttaa 1320 
tgatacacca ccggtcatgc ttctgttgat ccatatgtgt ggatacagga agtgggcgca 1380 
caccaacggg aggaagaagg tgcgaggagc cctttgtgtg ggtggcgttg tgacaccgat 1440 
tctgattgca tgtggtgtac ctctcacgtc tccagggttt gatccgagga tgatggattt 1500 
agatcacttg cgtcgttgtg agtttctgga gtacgacatg gttggcgatt tctatcgcta 1560 
caaattcgag cactccctga cccgaacagc caacattttg cttccctgca tcgaggccac 1620 
aaccatactt tagggtgaga acattgactt cagacctgcg cgtgattacc tctactttga 1680 
gagcactcca ccgactgatg acaatgtccc tacgacggaa gctacagagg atgattttgc 1740 
tgagacggat gaggataggg aggaggagta tgatacgagc atgtatcatt tcagtgagca 1800 
cgtacctcca gcgcaggaga gcaagagctt gagtgaagct cacagaaaca acagtaagtt 1860 
gcagaggtgg tgcaagaaac aagataggtt acttatcaag tgcttcaagg ccatcacgtt 1920 
tctaacggac aagataagtt gcttctcttc taccacagct attccgcagg gagagcgtcc 1980 
tcaggacatg ccttcgaaga gatatgacgc gccagggcca agtcatcaca ggcctgagcc 2040 
aagtcaccac aggcctgagc ctagtgaccg agtagtacca ccagtccctg caaggcattc 2100 
atcattcgag cctcgggagc tcgggagaaa gaagaaggct gcactcgcta ggtctggcag 2160 
caggagtaga cgacttctcc agtcccgtag cttacgcgac cgcggtgctg gccgcagcag 2220 
aagaagagag gtcgagtatc atcagagcgg tgctggccgc ggcgaaggag cagaggtcga 2280 
gtacccccag ggggaagctg agacacaaca gggagattct tcgatggcct gggagcaatc 2340 
acaggcagct attgacgacc aactccgctc cttcttccac tgaggtatgc acctcactcc 2400 
accattgtaa tataccatct cttgtttttt attttgtttt tgtgatgtgt tttgtcctga 2460 
gtactctctt ccaaatttgg tcacacagtg gactgtgtga tttaagtttg ggggagggct 2520 
caggaagtgt gtgttgcatt gtataatctt gagtttgcat tcatctaagg catagaaaaa 2580 
ccaaaaaaat tgaaaaattc cagaaaatga tttcacaaaa atagagtgtt catgtagttg 2640 
cattgcattt aggatcgagt ttagagtgtt tcgtttagga ttgttgcata tgcatagggg 2700 
ataataatga gatagccttg taagcatttt ggttcaccag ataagctcag tgccctcgtt 2760 
gttagttgtt tgatgcgttg tcattgaaat tgaagtaaga actgcacgat gcctagattg 2820 
ctctactcga ccacactgtt aggatctgat atcattccct atcaatttga acttgaatct 2880 
gatttagaat tatcatgtct tggcatcgaa tttgaactca tggataccct aaaatacttg 2940 
gattttctta ctcattttaa ccactcttgt tgatccaagt agctgactct ccttattaga 3000 
gcagttaacc catacccaaa cctgaacttt ctttcaagcc ctatatcact tgtgagtgtt 3060 
tgtgaggtct tatttcgatt gagcttggta gaaagtgtta ggttcgtaac gacagagata 3120 
gtgtctcatg tagttctagt ttgcgttctt cagactggat aggactaggt gggcgcttat 3180 
atcatgggtt gggatgtgtt taaaagaaaa gagggaatct attgttgatg aggaaaggga 3240 
aagaattcca ggggaagtaa gctaaagaag ttagaaaaaa aatctagtaa aggttttggg 3300 
aatgttaaag aaaagaatga ggttcttgtt agctaaagaa gaagggttaa aagcctttgg 3360 
ttttaaagat taaaaaaaaa acaggaacct tagttgttaa agaaatccaa acccgctaga 3420 
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tgtatcaaga 
aaaaagagtt 
ttttaccatt 
taatttggga 
ttacttaagc 
gagttcattg 
tggtagattt 
atggctaacg 
attgatttta 
ttcaactatg 
atgtctataa 
tcattcatca 
aactttgcat 
caaattggaa 
tggggccatt 
catccactcg 
ccccaggtcg 
agttactcga 
cactcgatca 
aggaagagaa 
aagcccatct 
ctatcatttt 
gtcgacattg 
ttgcttctgc 



gcgttgagaa 
tgaaagattc 
agaacttcat 
cttaccttta 
ctattctata 
atgatgcatt 
gaaagcatgt 
tttttgagta 
tttgctctat 
tcttcttatt 
tttgcatgtt 
ctgttttata 
acatattgca 
gaaatgagcc 
cttcccacat 
accaccgggt 
agtatcactt 
ccccctggtc 
cgtcttcaca 
gaggagaaga 
cggcccatta 
atttcgtttt 
agtttttttt 
aacctgtaat 



agcttctcct 
atgagtgcaa 
tgttatactc 
gcattctact 
agggaccatc 
gcttgattcg 
gtaggtcgag 
gaattcaatc 
catatgcttt 
tgcttgaggg 
ttcagtgtcc 
tcatttctca 
tttctgagtt 
agaaccagaa 
actcggcccc 
cgagtaacct 
cgccacacca 
gagtatcatc 
gtctactcaa 
caaagtgctt 
tctctctatg 
gtataaatag 
gcttcagttt 
tcgagatttt 
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agagttaaga gaaaagaaaa gaatgatatg 3480 
agggtagagt taagttggga caggagttgg 3540 
tgggtagatg ggatcttatc tctgtatgca 3600 
aaagctcaat cattcttgag ggatcccctg 3660 
tttgtctctt gaccttcacc ttggccgaat 3720 
cgttccagaa ctaatgaatg ttaaagggat 3780 
tataagagac ggattgattg aaaacaaggc 3840 
atatcgcatc ttagaactac caacttggac 3900 
ggttttgagt ccccgccttc actcctctcc 3960 
caagcaaaga ctaagtttga gggagttgat 4020 
attcatcatc gttttgagtc cagtttcgta 4080 
tcattcttgc atactttgca tgattaggat 4140 
gttttcaggt gatttggagc tgtttgcaag 4200 
gacatactcg acccctaggt cgagtgactt 4260 
caggtcgagt gactttggag ccattcttcc 4320 
tagctcaggc cactcgatga cactactcga 4380 
cctgacaaca ctcgaccaat cactctacca 4440 
actcaccacc atcagcatca ctcgaccgga 4500 
atccgcagtc aaccagacaa gctgagcaca 4560 
ggaagcggcc tggacctcca tcggatcacg 4620 
ggccgagcga ttaggttatt ggcccgtcta 4680 
atgtcttagg gttttgtcct gagacatcta 4740 
tattttctgt tctactctgc tgcgccgctt 4800 
tccaagttat tcagattccg catttgattt 4860 



<210> 10 
<211> 648 
<212> PRT 

<213> Arabidopsis thai nana 

Met°Ser°Asn Tyr Ser Gly Glu Ser ser Met Asp Ala Asp Tyr Asn Val 
1 5 10 15 

Asp Glu Ala Glu Ser Trp Ser Thr Arg Pro Glu Arg Glu Gin Gin Ala 
20 25 30 

Tyr Glu Ser Phe Arg Ala Glu Thr Gin Arg Ser val Ala Arg Arg Asn 
y 35 40 45 

Glu Arq Arg Ala Glu He Ala Arg Gly Lys Arg Ala Met Thr Ser Arg 
50 5 5 60 

Tyr Glu Leu lie Asp Glu Asp lie Asp val Glu Tyr Glu Pro Glu ser 
65 70 75 80 

Trp His Arg Glu Thr Lys Leu Leu Asn Lys Pro Asp Glu val Thr val 
85 90 95 

Glu Glu Tyr lie Arg Leu Phe Glu Leu Asn Asp Phe Trp Gly Ala Arg 
100 ~ 105 110 

Tvr Pro Cys Tyr Glu Thr Leu Ala Gin Leu Arg Leu Leu Glu Asp Val 
y lis 120 125 

Gin His Leu Phe Glu Lys Cys His Leu Glu Thr Leu Met Ser Tyr Pro 
130 135 140 

Tyr val Ala Tyr Lys Lys Glu Thr lie Glu Phe Leu Ser Thr Leu Gin 
145 150 155 160 

val Glu Leu Tyr Gin Gly Leu Thr Ala Asp Glu Leu Glu Ser Glu Gly 
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165 170 175 

Leu Gly Phe Leu Thr Phe Ser Val Asn Glu Gin Arg Tyr Gin Leu Ser 
180 185 190 

lie Lys Ser Leu Glu Gly Leu Phe Gly Phe Pro Ser Gly Lys Gly Thr 
195 200 205 

Lvs Pro Lys Phe Glu Arg Glu Glu Leu Lys Asp Leu Trp Leu Thr lie 
210 215 220 

Gly Asn Asp Leu Ala Leu Asn ser Ala Arg Ser Lys Ser Asn Gin lie 
225 230 235 240 

Arq Ser Pro val lie Arg Tyr Tyr Gin Arg Ser val Ala Asn Val Leu 
245 250 255 

Tvr Pro Arg Glu Ser Thr Gly Thr val Ser Asn Thr Asp Met Glu Met 
y 260 265 270 

He Asp Ser Ala Leu Lys Gly lie Leu Arg Arg Thr Lys Gly Lys Lys 
K 275 280 285 

val Leu Lys Gly Asp Leu Asn Asp Thr Pro Pro val Met Leu Leu Leu 
290 ' 295 300 

lie His Met Cys Gly Tyr Arg Lys Trp Ala His Thr Asn Gly Arg Lys 
305 310 315 320 

Lys Val Arg Gly Ala Leu Cys Val Gly Gly Val val Thr Pro He Leu 
325 330 335 

lie Ala Cys Gly val Pro Leu Thr Ser Pro Gly Phe Asp Pro Arg Met 
340 345 350 

Met Asp Leu Asp His Leu Arg Arg Cys Glu Phe Leu Glu Tyr Asp Met 
355 360 365 

val Gly Asp Phe Tyr Arg Tyr Lys Phe Glu His Ser Leu Thr Arg Thr 
370 375 380 

Ala Asn lie Leu Leu Pro Cys lie Glu Ala Thr Thr lie Leu xaa Gly 
385 390 395 400 

Glu Asn lie Asp Phe Arg Pro Ala Arg Asp Tyr Leu Tyr Phe Glu Ser 
405 ~ 410 415 

Thr Pro Pro Thr Asp Asp Asn val Pro Thr Thr Glu Ala Thr Glu Asp 
420 425 430 

Asp Phe Ala Glu Thr Asp Glu Asp Arg Glu Glu Glu Tyr Asp Thr Ser 
435 440 445 

Met Tyr His Phe Ser Glu His Val Pro Pro Ala Gin Glu Ser Lys Ser 
450 , 455 460 

Leu Ser Glu Ala His Arg Asn Asn Ser Lys Leu Gin Arg Trp Cys Lys 
465 470 475 480 

Lys Gin Asp Arg Leu Leu lie Lys Cys Phe Lys Ala lie Thr Phe Leu 
485 490 495 

Thr Asp Lys lie Ser Cys Phe ser Ser Thr Thr Ala lie Pro Gin Gly 
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500 505 510 

Glu Arq Pro Gin Asp Met Pro Ser Lys Arg Tyr Asp Ala Pro Gly Pro 
515 520 525 

Ser His His Arg Pro Glu Pro Ser His His Arg Pro Glu Pro Ser Asp 
530 535 540 

Arq val Val Pro Pro val Pro Ala Arg His Ser Ser Phe Glu Pro Arg 
545 550 555 560 

Glu Leu Gly Arg Lys Lys Lys Ala Ala Leu Ala Arg Ser Gly Ser Arg 
565 570 575 

Ser Arg Arg Leu Leu Gin Ser Arg Ser Leu Arg Asp Arg Gly Ala Gly 
580 585 590 

Arq Ser Arq Arg Arg Glu val Glu Tyr His Gin Ser Gly Ala Gly Arg 
595 600 605 

Gly Glu Gly Ala Glu val Glu Tyr Pro Gin Gly Glu Ala Glu Thr Gin 
610 615 620 

Gin Gly Asp Ser Ser Met Ala Trp Glu Gin Ser Gin Ala Ala lie Asp 
625 630 635 640 

Asp Gin Leu Arg Ser Phe Phe His 

645 



<210> 11 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 11 

tggttcacca gataagctca gtgccctc 

<210> 12 
<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 12 

cttcagactg gataggacta ggtgggcg 

<210> 13 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 13 

cgcccaccta gtcctatcca gtctgaag 



<210> 14 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 14 

cgcatcaaac aactaacaac gagggcac 



<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 15 

cgataacatc gaccgtattg ctcgcc 



<210> 16 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 16 

aactagctcc catccgtctt cgacatcc 



<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 17 

tgcatcacac cggattggat tgac 



<210> 18 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 



<223> Description of Artificial Sequence: 



Oligonucleotide sequence 



<400> 18 

tgttcccctg aaccatagca atgagacc 



28 



<210> 19 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
01 i gonucl eoti de sequence 

<400> 19 

caaacagaca gagtgtggcc caccacc 27 



<210> 20 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 



<210> 21 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 

<400> 21 

tgcaaaccca caggaccaag tctaccc 27 



<210> 22 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide sequence 



<400> 20 

caaacagaca gagtgtggcc caccacc 



27 



<400> 22 

acagatggtg atagcgtgag cggtggc 



27 



<210> 23 
<211> 26 
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<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: 
oligonucleotide sequence 

<400> 23 

tcaacctttt gccccaacaa ccactc 26 



<210> 24 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide sequence 

<400> 24 

tctccatcca cgctttcctg aatgtcc 27 



<210> 25 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide sequence 

<400> 25 

ggagaaggaa gctgaaaatc atattgtgg 29 



<210> 26 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide sequence 

<400> 26 

atgatgatcc taagtctacc cttttgcac 29 



<210> 27 
<211> 3122 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 27 

ttcatatatt cgacctcttc ttctcattct 
gctttccctc acaactctca ctcgaccacc 
ctctcatcgc catctctcaa catactcgac 
caccgcctct ccatcgtcac cgcctgctcc 
atttcactca ctcgacctct accaccaagc 
cgaccacctt caccatcaac caatcaaatc 
cgaccgctga acacttatca ccttcaagct 



tgcatccaaa agacacaaca agccgccatc 60 
gcccgctctc tcacttactc ggcttcatcg 120 
ctcgcgatat cactcgagct cgccgcttct 180 
ctctctccaa ggaaacaact cgagctctcc 240 
cggcttcacc acttctagct cttaaccact 300 
gttttctcct ccattaaagc ttgacatact 360 
cctcatctct tcatcgtttc caacaccgct 420 
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gctctcatcc cccacgaaag cttgtcatca cctctcactc atcaccagtt cactcgattc 480 

agcaaccaaa ctcgacctcg tctctcttgc cactcatagt cactcgatct ctcctcacca 540 

tcttcatcat ctcccttact cgaccaccgt gcgtctcgct ccaccattgc catttaaaag 600 

ctcactcgat tgtcaaagag aagaagagtg aagctcaacc accgccactc gaccgcgttt 660 

ccctctacac attcaacact cgaccacggt gctaccatct ccacacccgc tcttgttcac 720 

catacactcg accaacaact ctcaaagtaa aaaaaaaaag aaaaaaaaag tcaaaaccga 780 

cagtttcact caaccggttt actcgaccgg tacgctggtt tagattgtgt ttttggtttt 840 

gctattacta acatattaac gtttatcttt gagtttcgtc tgtttttagg tttcatcatg 900 

agtaactaca gtggaaaatc ctctatggac cctgattata atgtggatga agctaagtcc 960 

tggtccacta gaccggagtg agagcaacat gtttacgaga gctataggga tgaatttgaa 1020 

cgctctgcag ctcgacgtaa tcaaagaaga gctgaaatcg ctagaggaaa gagggcgatg 1080 

tcgagtagat atgagctgat tgatgaggat atcaaaactg agtatgagcc agagtcatgg 1140 

cgcaaggaga cgaagctact gaacaaatcc gacgaggtta cagtggagga gtatatcaga 1200 

ttctttgaga tgaatgactt ctggggaacg aggtatccct gatatgagac tttagcccag 1260 

ttggggttac tggaggacgt gcagcatctg ttcgagaagt gtcatctgat aaggaggaga 1320 

caatcgagtt tctttccaca ctgcaagtgg aaatgtatga gggactcaca gactttgagc 1380 

tggataccat ggggttaggc ttcttgacgt tcttagtgga tgaacagcgg taccagattt 1440 

agatcaagaa attggaagaa ctgtttggtt tccctagtgg aaagggaacc aaccccaggt 1500 

ttgacaggga agagcttaag gatttgtggg ctactattgg gaacaatcta ccgctaaact 1560 

cgacgcggtc caagagcaac caaatccgga gtcctgtgat tcgctacttt cagcgctcgg 1620 

ttgccaatgt tttttactcc agggagtcta caggcaccgt gtctaacaca gacatgaaga 1680 

tgatagattc agcgcttata gggattctcc gccttacaaa aggaaagaat gtcctgagag 1740 

gagatcttaa cgactcacca ccagtaatgc ctctgttgat ccatctgtgt gggtacatga 1800 

agtgggcgct gacaaacggc aagaagaagg taagaggagc actatgcgtg ggtggcgttg 1860 

tgacgccaat tctgaaagtt tgtggagttc cgctcaagga agtagggtta gcaccgagaa 1920 

tgatggactt ggatcacttg cgccgatgtg agttctctga gtttgacatg gttggcgact 1980 

ttcaccgcta caggttcgag cattcatcga ttagaatcgc caacattctt ttcccctgca 2040 

tttacgctac taggattctc gagggcagga acattgactt caagcctgcg cttgaagatc 2100 

tttatttcga gggcagtccg ccaactgagg agattagtca caccgaagga gctacaatag 2160 

aagatgttga tgagacatat gatatagatg aggcggagtt tgacacgagc atgtatcatt 2220 

tcagtgagca tatacctcca gcgaggaaaa gcaagagttt gagcgaagct cacaggaaca 2280 

acagcaagct gcagaagtgg tgcaagaaac aggataagtt actcgccaag tgcctcaggg 2340 

ctatcaagtt tctgaaggac aagatcagct gctcctcttc cactacaact attccgcaat 2400 

gacagctccc tcaggacatg ccttcgagga gatatgacgc gcccgagcct agagagcaga 2460 

agattctgca tgtccctgcg aggcattcat cattcgagcc tcgtgaatct aggaagaata 2520 

ggagaacgac actcactcga tctagcagca ggagcagacg acttctgcag tctcgtagtt 2580 

tacgcgaccg cggtgctggc cgcaatagaa gaagagaggt cgagtatcct cagagcggtg 2640 

ctggccgcca cagagctgat gagatcgagt acccacatgc tggagctgat acggaacatg 2700 

gcggttcgtc tatggcttgg gagcaatcac aggcagccat tgactaccaa cttcgttcat 2760 

tattcgactg aggtaagcgc ctcacttcac cattatatta tatcatctct tgtgatttgt 2820 

tctttatttt gtttcagtga ttggatttgt cctgagtact ctcttccaag tttattcaca 2880 

cagtggactg tgtgatttaa gtttggggga gggctcagga agtatgttgc attgtatata 2940 

tttttaagtc tgcattcatc taaggcatag aaaaaccaaa aaaaaattaa aaatttcaga 3000 

aaatgatttc acaaaaaaag agtgttcatg tagttgcatt acatttagga tcaagtctag 3060 

agtgtttcat ttaggattgt tgcatatgca taggggataa tgatgagata gccttgtaag 3120 
ca ~~ ~ 3122 
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